Claudio Canuto « Anita Tabacco 


Mathematical Analysis | 


Second Edition q 


UNITEXT — La Matematica per il 3+2 


Volume 84 


For further volumes: 
http://www.springer.com/series/5418 


Claudio Canuto - Anita Tabacco 


Mathematical Analysis I 


Second Edition 


G) Springer 


Claudio Canuto Anita Tabacco 


Department of Mathematical Sciences Department of Mathematical Sciences 
Politecnico di Torino Politecnico di Torino 
Torino, Italy Torino, Italy 


UNITEXT - La Matematica per il 3+2 
ISSN 2038-5722 ISSN 2038-5757 (electronic) 


ISBN 978-3-319-12771-2 ISBN 978-3-319-12772-9 (eBook) 
DOI 10.1007/978-3-319-12772-9 
Springer Cham Heidelberg New York Dordrecht London 


Library of Congress Control Number: 2014951876 


© Springer International Publishing Switzerland 2015 

This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part 
of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, re- 
citation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or 
information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar 
methodology now known or hereafter developed. Exempted from this legal reservation are brief ex- 
cerpts in connection with reviews or scholarly analysis or material supplied specifically for the purpose 
of being entered and executed on a computer system, for exclusive use by the purchaser of the work. 
Duplication of this publication or parts thereof is permitted only under the provisions of the Copyright 
Law of the Publisher’s location, in its current version, and permission for use must always be obtained 
from Springer. Permissions for use may be obtained through RightsLink at the Copyright Clearance 
Center. Violations are liable to prosecution under the respective Copyright Law. 

The use of general descriptive names, registered names, trademarks, service marks, etc. in this publi- 
cation does not imply, even in the absence of a specific statement, that such names are exempt from the 
relevant protective laws and regulations and therefore free for general use. 

While the advice and information in this book are believed to be true and accurate at the date of pu- 
blication, neither the authors nor the editors nor the publisher can accept any legal responsibility for 
any errors or omissions that may be made. The publisher makes no warranty, express or implied, with 
respect to the material contained herein. 


Cover Design: Simona Colombo, Giochi di Grafica, Milano, Italy 
Files provided by the Authors 


Springer is a part of Springer Science+Business Media (www.springer.com) 


Preface 


This textbook is meant to help students acquire the basics of Calculus in curricula 
where mathematical tools play a crucial part (so Engineering, Physics, Computer 
Science and the like). The fundamental concepts and methods of Differential and 
Integral Calculus for functions of one real variable are presented with the primary 
purpose of letting students assimilate their effective employment, but with critical 
awareness. The general philosophy inspiring our approach has been to simplify the 
system of notions available prior to the university reform; at the same time we 
wished to maintain the rigorous exposition and avoid the trap of compiling a mere 
formulary of ready-to-use prescriptions. 

In view of the current Programme Specifications, the organization of a first 
course in Mathematics often requires to make appropriate choices about lecture 
content, the comprehension level required from the recipients, and which kind 
of language to use. From this point of view, the treatise is ‘stratified’ in three 
layers, each corresponding to increasingly deeper engagement by the user. The 
intermediate level corresponds to the contents of the eleven chapters of the text. 
Notions are first presented in a naive manner, and only later defined precisely. 
Their features are discussed, and computational techniques related to them are 
exhaustively explained. Besides this, the fundamental theorems and properties are 
followed by proofs, which are easily recognisable by the font’s colour. 

At the elementary level the proofs and the various remarks should be skipped. 
For the reader’s sake, essential formulas, and also those judged important, have 
been highlighted in blue, and gray, respectively. Some tables, placed both through- 
out and at the end of the book, collect the most useful formulas. It was not our 
desire to create a hierachy-of-sorts for theorems, instead to leave the instructor 
free to make up his or her own mind in this respect. 

The deepest-reaching level relates to the contents of the five appendices and 
enables the strongly motivated reader to explore further into the subject. We 
believe that the general objectives of the Programme Specifications are in line with 
the fact that willing and able pupils will build a solid knowledge, in the tradition 
of the best academic education. The eleven chapters contain several links to the 
different appendices where the reader will find complements to, and insight in 


VI Preface 


various topics. In this fashion every result that is stated possesses a corresponding 
proof. 

To make the approach to the subject less harsh, and all the more gratifying, 
we have chosen an informal presentation in the first two chapters, where relevant 
definitions and properties are typically part of the text. From the third chapter 
onwards they are highlighted by the layout more discernibly. Some definitions and 
theorems are intentionally not stated in the most general form, so to privilege 
a brisk understanding. For this reason a wealth of examples are routinely added 
along the way right after statements, and the same is true for computational 
techniques. Several remarks enhance the presentation by underlining, in particular, 
special cases and exceptions. Each chapter ends with a large number of exercises 
that allow one to test on the spot how solid one’s knowledge is. Exercises are 
grouped according to the chapter’s major themes and presented in increasing order 
of difficulty. All problems are solved, and at least half of them chaperone the reader 
to the solution. 

We have adopted the following graphical conventions for the constituent build- 
ing blocks: definitions appear on a gray background, theorems’ statements on blue, 
a vertical coloured line marks examples, and boxed exercises, like 12. |, indicate 
that the complete solution is provided. 

We wish to dedicate this volume to Professor Guido Weiss of Washington 
University in St. Louis, a master in the art of teaching. Generations of students 
worldwide have benefited from Guido’s own work as a mathematician; we hope 
that his own clarity is at least partly reflected in this textbook. 


This second English edition reflects the latest version of the Italian book, that 
is in use since over a decade, and has been extensively and successfully tested at 
the Politecnico in Turin and in other Italian Universities. We are grateful to the 
many colleagues and students whose advice, suggestions and observations have 
allowed us to reach this result. Special thanks are due to Dr. Simon Chiossi, for 
the careful and effective work of translation. 

Finally, we wish to thank Francesca Bonadei — Executive Editor, Mathem- 
atics and Statistics, Springer Italia — for her encouragement and support in the 
preparation of this textbook. 


Torino, August 2014 Claudio Canuto, Anita Tabacco 


Contents 


LL M5asle MOONS < cas nse opai5on gees Sears he ease een sa Gsee oR aes dL. 
Wel (CUS +242 ¥evdee Gs dee eh eee beeG eek och. 253 4k des Ghee eed oes 1 
1.2 Elements of mathematical logic... ....60. 060.0 csccee eee eeeeeues 5 

Le 2k - CONTECTIVES: casdaeicugeke dua Pees cee eed ae bud Maa aeonea 5 
Meee. OUNCES ea aha eae BERS he son Ws Ree ad de a eee 6 
Look --C)GSIITCrS 55 a ho he bo eet Rohan FERRERS Rees ri 
Ia. els Of HUMDCrS 942444944 i460 24 4G 8s oe G84 a os AGREES HR EH OA 8 
Lal. The ordering of real numbers ..<a0sscsdese ceneuedsdeae es cle, 
Iies2 Mompleieness 06 IN xs.u0s. sone eneeee wderva dae ceaaed Ames 17 
1.4 Factorials and binomial coefficients ................ 00.00 e eee eee 18 
Lm: “Cariesiat) PrOdUGh 5024. p43040es be hoor d choad eee as DEES Rees 21 
LG Kelations-in the plane. <. 2.s<<i4icceudodceees sed Ricucucededses 23 
Wet TOROR CSOs: ci caus cee nae hea ee deeu Seu Se ee eed vEwe ERS EbRERE EES YA 20 
Utell, SOU OMG Soe aerated Awana eae ba ee eee eee aaa 26 

2. BPUMCHIOUNS 4 A-c5c. vase ate aes e eos eawed And aes we aba w need we Rees 31 
2.1 Definitions and first examples ............... 0c cece eee eee 31 
22 Ratige and preimage :252<icaeiiige Piateshestandbidesdehaehes 36 
2.3 Surjective and injective functions; inverse function............... 38 
24. Monotone TUnCHONS 4 2 4 ncn se ee Gu Read dd SERRE ERRESARERSARSEES BO Al 
20 Sompositien-of TINnchOng <..4+40.¢09-4+5os cadcokescadine cd eeeae ds 43 

2.5.1 Translations, rescalings, reflections...................005- A5 
2.6 Elementary functions and properties ................0. eee ee eee AT 
200.) POWeUS 44 itu cc ge bade ck ci deatheSabend gad widens veddasus 48 
2.6.2 Polynomial and rational functions ..................0000. 50 
2.6.3 Exponential and logarithmic functions ................... 50 
2.6.4 Trigonometric functions and inverses.............00000 00s 51 
2. WRCUOBCS ci oc an dee ee dud Gekedad ark vs ea ee onde ew ek ae eeeencu sea 56 


27 A. SOIitONS v.08 acs ok web ben ee be OE Re RRR De he Ree hos 58 


VIII 


3 


Contents 
Limits and continuity 0: sccie4 2a 24euese eee eee eee eee a4 65 
Sul 2INCISNDONTNOOUS 4.\.0 306 ui Bet ee OPAC’ Shotts toe be Be eh Ae tn 2 65 
Sie Agi OF a SEGUENCEy, 3.4554e sais aunatuos Coe diaian ROT een OMe a te see ts 66 
3.3. Limits-of functions: continuity. tsc.49 need Fae eds ane be oa ee es 72 
Oroods ts GINO: oak 24d aed ee dd ad bee ae ee ane’ 73 
jco2° “Continuity, Limits at: real points : 445.403 4:4 3.4940 beg sad ess 74 
3.3.3  One-sided limits; points of discontinuity .................. 82 
3.3.4 Limits of monotone functions ............. 000. ee eee eee 84 
See, UROL CLS Rater ads aidan RA ah alent wala eed Sew Nee Ga eM eater Ss 87 
BA,” GMA ONS bop teed ed hoe Ele AOE Ae ie Ba ted ae Boba dae eRe a ui ed cand 87 
Limits and continuity 10 2. 22229ci46 oh en dt) Pee eke ees 89 
AoW Theorenisoi iiss os users at we i ues Soy wee Sea ead 89 
4.1.1 Uniqueness and sign of the limit......................0.. 89 
4.1.2 Comparison theorems ............ 0.0. eee eee eee eee 91 
4.1.3 Algebra of limits. Indeterminate forms of algebraic type .... 96 
AA. 7 SUUStibitioly t eOPeIi 23.46. O iets wie es oleae oaky Sie hh audtem 102 
4.2. More fundamental limits. Indeterminate forms of exponential type . 105 
4.3 Global features of continuous maps .............. 00. c eee ee eee 108 
Ah « UMC RCISES oa Pad ALGER a DP CAING BAAR a RD sp Pe he oe £G BAe RRP os 115 
AACN, . SONG UNG 2 ah Rak 6 Ak a Cala te el ptt Bh she a le el pt Oe al Me 117 
Local comparison of functions. Numerical sequences and series 123 


Sol, vhandaa Sym Ole oes vars Sore hoe hee eee 8 oh pla toe ee oe AEN 2 123 
5.2 Infinitesimal and infinite functions ............. 0... cee eee eee 130 
Diss, ABV DhOUeS Shc. 4h ahs Ue aatd ain Raha aud ae Suse hte > Hees Bal ee 135 
5.4 Further properties of sequences... 1.0.0... 0000 c eects 137 
So. WNuimerical Series... 2 s4:6. 3.0 Leewee ke igesa deh kd pe eee te eawe sea 141 

sO: FOSIULVE-TOTUD SENOS ey oo shes Su ea ante ene aoe a eee ata 146 

Dee xrermanine Series 4.0% ahaa eeehe we Red eat ea aero ou tees 151 
DVO: MOL CIS OS rca 1o'h oda sich ot wean ga eee rahe ere Rok ad eS Shae Atay eA Uw weer ke 154 

Paul. DOMMIGNS ad 2a ind ie PPA ER BE EAS ak te PRESS BAe Be Rees § 157 
Ditterential caleulus: 4.449 241i 2Ged Ge did Pais Le bid eee RARE RAs 169 
Gl!’ The -derivaye wc esas ent ieee tee se aaculineSUG eh aoe eae eee ed 169 
6.2 Derivatives of the elementary functions. Rules of differentiation... .172 
6.3 Where differentiability fails ........ 0.0.0... cee eee ECE 
64> Extrema and critical Poms coke ads whan Aenea eee eee ee 180 
6.5 Theorems of Rolle, Lagrange, and Cauchy ...................... 183 
6.6 First and second finite increment formulas ...................0-5- 186 
O.4) ~Monoloiie tases A tia ea ed emis ee ud eee aah ais he ahaa 188 
6.8 Higher-order derivatives .............. 02.0 e ee ee eee ee teens 190 
6.9 Convexity and imflection points a3 owio 4 ba acetate Bae Ee a8 192 

6.9.1 Extension of the notion of convexity .................205. 195 


6.10 Qualitative study of 6 TuniChiOn: . 64.05 cece daw ad lohan hae wad 196 


Contents IX 


Gal O21 dy per bole. TUMCHIONS 4s 553 4 ath nh hr ee ee Gina aa aad 198 
61 The Theorem of de PHopital:i15.c22024h245-ueyes eu hese caet4 es 200 
6.11.1 Applications of de Hopital’s theorem...................0. 202 
GN TROP CIRCS: freee h AeA sieve io A's Mg wteh Seas eet icc ALG tat tate A Reinhart 203 
Os 24 SSGUAOMS: of 35: oid 4B Ea ie bOI od i ea ed Re een ene 207 
Taylor expansions and applications ......................0.000- 225 
sieallls «Seer TOMI VAS) 8, c2sch asap A ea Sacha ak ee ee Seal Sheela at Ga el erie hada 225 
7.2 Expanding the elementary functions .......... 0.0.0.0. cee ee eee 229 
7.3 Operations on Taylor expansions .......... 0.000 c cece eens 236 
7.4 Local behaviour of a map via its Taylor expansion ............... 244 
TA TPO TACEC: Ad cet ad te RE Re eM er hE Ae ee RE ed ak eee eae et 248 
Peels SOUAMONS ts ea ak aieae el eg deg ane Raw a ee Hans eae es 250 
Geometry in the plane and in space ...................-..+0-55 259 
8.1 Polar, cylindrical, and spherical coordinates..................05. 259 
8.2 Veetorsin the plane and inspace viois 2 wcgu geod aad oma wate 24 262 
Seo. GrOsitiOn-NeGhOrs i. tas ties favely Shree! Gee eae ak he 262 
8.2.2 INorim-and- scalar produel, 2: 234.424 ae 2o S343 eee ee RE ook 265 
8.2.3- “General vectord cesta ea teuigaw ae cid gabe ae how eas wees 270 
raya amme 12) cy 0N( 2. (1401 0¢5) CC ae ane eee se er are oe ee ese See ere nes 271 
8.3.1 Algebraic operations ............ 0.0. cece eee ee eens 272 
Siae2, “Cartesiail COOTdINaTES *s hu Ca lay kK ae Got ead a Rea SNe das 273 
8.3.3 Trigonometric and exponential form ..................... 215 
S.52- Powers end Uh OOS occied S8Gi ae ei hada eran sa areebs ae ad 277 
S57 AISebraic. CQUATIONS: sy cc:dicade setae ee % Sa doi oud BRR a ees 279 
8.4 Curves in the plane and in space ............ 00.0 c eee eee eee eee 281 
8.5 Funetions-of several variables 2. s20.0%35 ca cee ea 0d 65 ed aeaee se es 286 
Seeds, A OMUIEY ) fsck exe iO op ee Nien teid <TR MO Oe 5. ee epee wa 286 
8.5.2. Partial-derivatives:-and gradient) 2.0.24 5.:¢4sd60000% 024% 288 
BO VT ROPCISOS Gee ded oencide ae evais a ea Aa Ahern ety Muse asd ae Ghee Asie Ble AN bw ates oe 291 
G.0sd:.  MOMMIONS 349.449.2462 2 8 RRS BLEEP aa te Dee ES BES Shee ag 294 
Integral calewlus:S 34th. 4 Seed soe pa tneehe leaked AS epee nee aes 301 
9.1 Primitive functions and indefinite integrals ...................4. 302 
9.2 Rules of indefinite integration ........... 0.0... cece eee eee 306 
9.2.1 Integrating rational maps.................-.2+2 eee eee eee 312 
Ors. -MMeimiarte Tite eras riac ceeds ap tart death aed Gc Rhte dish a nat ea as ac 319 
OA. “Whe Cauehy imteeral 253 4c8t ane ee Reat adie Pel 2G Me Phe lea eas 320 
0.5. Thieshiemamiintetral ¢ c%s3 ses sete ceed sek ee oe Sees tae eee 322 
9.6 Properties of definite integrals............. 0.0... cece eee eee ee 328 
OT. “Inteoral mean -valtie. 2 2.5%,4 i254 652o55 302-23 dae teat dee eae Sakae 330 
9.8 The Fundamental Theorem of integral calculus .................. 333 
9:9° Rules. of definite integration 2.4 :245244 ehieses Rese 24 4A RSENS 338 


9.9.1 Application: computation of areas ......... 0.00 340 


xX Contents 


09 PAVE Wl alps 2) Ech <a aR ee 342 
0.10) SOMOS 05.6, 2 ti un sew eee ene’ Paste boa Rees ceo gg 345 

10’ Sntegral calenlas: 1120223 c4 uceeestevtuny sdokereeeeeedve sca eR ox 2 307 
TO. Tipro per inberrals.s 4, o.2 0.8 atc annak Sate band sen Bates eesti ee Smale # Y 3507 
10.1.1 Unbounded domains of integration ...................4-. 307 

LO: Wnbounded inteerande:..> 3222.25 02465 ¢.iat-ad ee ean 6 eee te 365 

LD Moreédmproper integrals 224s at ale ete eels eas cate head oo 369 
1023 Integrals along Curves ge 3 aoc. g sees ad eee ae eee ee See eS 370 
10.3.1 Length of a curve and arc length ...................2000- 375 

LO Investral vector eslculiein.s eet ede eee eo al tae bebe ae se 378 

TOs HON CISC ta A056 rd fe ae id a Bie soe BOE ed alee Need Bee eae a eee 380 
Op el. SOUMAOTIS, 9.7 oacasais e's cue eile Rag Sig ai kph a Me oe RNG 382 

11 Ordinary differential equations .................. 00.000 eee eee 389 
IG.etiera ll den mites snc Saco tee eerie G awa ber abe ee BRS oe Noe 389 
11.2: First order differential equations... ¢6.4 cc. nace ace dene ne eee bo 390 
11.2.1 Equations with separable variables ..................2.0. 394 

1d 22 inear equpiiOns:24ei cos tees ee ea ee Boetad Bead hee dake 396 

11,2.3- Homogeneous equations’. 422.23 doaceGesri ess tecse desde ee <4 399 

11.2.4 Second order equations reducible to first order ............ 400 

11.3 Initial value problems for equations of the first order ............. A401 
LAS ule IsipSeliia Tin Ctons sa. asta aed a tis wae aed at et cier hat ade 401 

11.3.2 A criterion for solving initial value problems .............. 404 

11.4 Linear second order equations with constant coefficients .......... 406 
Ur EOP CIS OS ea ohs Siraee AS Sela Ay et tag Ant aig eile dee ia AG eae Aad Dias Mee ee 2 412 
Tl SOUMIONS Sf cn ae conh aging mand tO @ Mads eee gain Awe dink ames os 414 
PPPENGICCSs. 59. 4cxas tae Ag ed kaw oe ddd bg De Dee ei ek Sele S Peewee 425 
A.1 The Principle of Mathematical Induction ...................... 427 
A.2 Complements on limits and continuity ......................... 431 
2 Woah Ds 6a 1 reece ae Re eer ow ee are Wren ere DoD me PG) rte eerie 431 
AD.2 Tlemiehitary DInChOns 4 ocho eo ee S44 oS pee Kes NS OS hee ORS 435 
2.3 Napier’s numbers. 22g. ates eee eat eigdg eins San Rae eaeeedes Ces 437 
A.3 Complements on the global features of continuous maps ....... 441 
A 3) SUDSCOMCTICES a. 60h e-9.i6-9- 50 4 silo actee aonicy whe apa Wal wpa S asa POT ene 441 
A.3.2 Continuous functions on an interval ........... 0.00.0. 0 cee eee 443 


Ag 9 co) UPON COMTI, est Set a haa Od GR ee aed ae AR ate et 447 


Contents XI 


A.4 Complements on differential calculus .........................-. 449 
A Al Derivation: formas acs a oho pete a het ee Rot oy Beno Akh Le 449 
Aca? Del Hopital’ sb eOrerii ss: 415,35 sence hunis teow ped dane a oak area oe CR esos he 452 
AeA S CONVEX, TAUCHONS 45..002:54 wore tien Ra ge Ae a uid sq wae Rete bE Ra ES 454 
A A Vay lor Tories. 2 so0 he 2c bos yee edt oi Gd pee bd Ae PEARS eed 456 

A.5 Complements on integral calculus.........................00005 461 
ASU ie atiGy: WOOT alg aks sua he dnd ede Rh de rie ia ince Sh eased ah 461 
ALS. 2 The Piemani Mteor al .j5 a y.cm selene aera nea ene epee EP RT 462 
AcSe 5 1M OEODER INUCETONS s, « ancaseheSis fn Qc oats hee weaned Aras Saas Smee § 470 

‘Ables-and: Formulas's.< X44 do0ledss cGaegees weensis Hae sees es bees A473 


1 


Basic notions 


In this introductory chapter some mathematical notions are presented rapidly, 
which lie at the heart of the study of Mathematical Analysis. Most should already 
be known to the reader, perhaps in a more thorough form than in the following 
presentation. Other concepts may be completely new, instead. The treatise aims 
at fixing much of the notation and mathematical symbols frequently used in the 
sequel. 


1.1 Sets 
We shall denote sets mainly by upper case letters X,Y,..., while for the members 
or elements of a set lower case letters x, y,... will be used. When an element 2 is 


in the set X one writes x € X (‘x is an element of X’, or ‘the element x belongs 
to the set X’), otherwise the symbol x ¢ X is used. 


The majority of sets we shall consider are built starting from sets of numbers. 
Due to their importance, the main sets of numbers deserve special symbols, namely: 


N = set of natural numbers 
Z = set of integer numbers 


Q = set of rational numbers 
R = set of real numbers 
C = set of complex numbers. 


The definition and main properties of these sets, apart from the last one, will 
be briefly recalled in Sect. 1.3. Complex numbers will be dealt with separately in 
Sect. 8.3. 


Let us fix a non-empty set X, considered as ambient set. A subset A of X 
is a set all of whose elements belong to X; one writes A C X (‘A is contained, 
or included, in X’) if the subset A is allowed to possibly coincide with X, and 
A Cc X (‘A is properly contained in X’) in case A is a proper subset of X, that 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_1, 
© Springer International Publishing Switzerland 2015 


2 1 Basic notions 


CA 


Figure 1.1. Venn diagrams (left) and complement (right) 


is, if it does not exhaust the whole X. From the intuitive point of view it may 
be useful to represent subsets as bounded regions in the plane using the so-called 
Venn diagrams (see Fig. 1.1, left). 


A subset can be described by listing the elements of X which belong to it 


Aa ene (2k 


the order in which elements appear is not essential. This clearly restricts the use 
of such notation to subsets with few elements. More often the notation 


A= {ze X || pia)} or A={xrEX : p(x)} 


will be used (read ‘A is the subset of elements x of X such that the condition p(x) 
holds’); p(a) denotes the characteristic property of the elements of the subset, i.e., 
the condition that is valid for the elements of the subset only, and not for other 
elements. For example, the subset A of natural numbers smaller or equal than 4 
may be denoted 


A= {0,1,2,3,4) or A={reEN|a< 4}. 


The expression p(x) =‘x < 4’ is an example of predicate, which we will return to 
in the following section. 

The collection of all subsets of a given set X forms the power set of X, and 
is denoted by P(X). Obviously X € P(X). Among the subsets of X there is the 
empty set, the set containing no elements. It is usually denoted by the symbol 
0, so @ € P(X). All other subsets of X are proper and non-empty. 

Consider for instance X = {1,2,3} as ambient set. Then 


P(X) = {0, {1}, {2}, {3}, {1 2h, {1, 8}, (2, 3}, X}- 


Note that X contains 3 elements (it has cardinality 3), while P(X) has 8 = 2° 
elements, hence has cardinality 8. In general if a finite set (a set with a finite 
number of elements) has cardinality n, the power set of X has cardinality 2”. 


1.1 Sets 3 


Starting from one or more subsets of X, one can define new subsets by means 
of set-theoretical operations. The simplest operation consists in taking the com- 
plement: if A is a subset of X, one defines the complement of A (in X) to be the 


subset 
CA={xEXx|a¢ A} 


made of all elements of X not belonging to A (Fig. 1.1, right). 

Sometimes, in order to underline that complements are taken with respect to 
the ambient space X, one uses the more precise notation Cx A. The following 
properties are immediate: 


CX=0, C@=X, C(CA)=A. 


For example, if X = N and A is the subset of even numbers (multiples of 2), then 
CA is the subset of odd numbers. 


Given two subsets A and B of X, one defines intersection of A and B the 
subset 


ANB={reEexX|xeAandze B} 


containing the elements of X that belong to both A and B, and union of A and 
B the subset 


AUB={xEX |xeEAorxre Bh 


made of the elements that are either in A or in B (this is meant non-exclusively, 
so it includes elements of AM B), see Fig. 1.2. 


We recall some properties of these operations. 


i) Boolean properties: 


ANCA=9, AUCA=X; 


AUB AUB 
B 
A 
xX xX 


Figure 1.2. Intersection and union of sets 


4 1 Basic notions 


ii) commutative, associative and distributive properties: 


ANB=BNA, AUB=BUA, 
(ANB)NC=AN(BNOC), (AUB)UC=AU(BUC), 
(AN B)UC=(AUC)N(BUOC), (AUB)NC=(ANC)U(BNO); 


iii) De Morgan laws: 
C(AN B) =CAUCB, C(AU B) =CANCB. 


Notice that the condition A C B is equivalent to AN B= A, or AUB=B. 


There are another couple of useful operations. The first is the difference 
between a subset A and a subset B, sometimes called relative complement 


of Bin A 
A\B={xeEAl|a¢B}=ANCB 


(read ‘A minus B’), which selects the elements of A that do not belong to B. The 
second operation is the symmetric difference of the subsets A and B 


AAB =(A\B)U(B\ A) =(AUB)\ (ANB), 


which picks out the elements belonging either to A or B, but not both (Fig. 1.3). 


For example, let X = N, A be the set of even numbers and B= {ne N|n< 
10} the set of natural numbers smaller or equal than 10. Then B\ A = {1,3,5,7, 9} 
is the set of odd numbers smaller than 10, A \ B is the set of even numbers larger 
than 10, and AAB is the union of the latter two. 


A\B 


Figure 1.3. The difference A \ B (left) and the symmetric difference A AB (right) of 
two sets 


1.2 Elements of mathematical logic 5 


1.2 Elements of mathematical logic 


In Mathematical Logic a formula is a declarative sentence, or statement, the truth 
or falsehood of which can be established. Thus within a certain context a formula 
carries a truth value: True or False. The truth value can be variously represented, 
for instance using the binary value of a memory bit (1 or 0), or by the state of 
an electric circuit (open or close). Examples of formulas are: ‘7 is an odd number’ 
(True), ‘3 > V/12’ (False), ‘Venus is a star’ (False), ‘This text is written in english’ 
(True), et cetera. The statement ‘Milan is far from Rome’ is not a formula, at least 
without further specifications on the notion of distance; in this respect ‘Milan is 
farther from Rome than Turin’ is a formula. We shall indicate formulas by lower 
case letters p,q,1,.... 


1.2.1 Connectives 


New formulas can be built from old ones using logic operations expressed by certain 
formal symbols, called connectives. 

The simplest operation is called negation: by the symbol —p (spoken ‘not p’) 
one indicates the formula whose truth value is True if p is False, and False if p 
is True. For example if p=‘7 is a rational number’, then —p =‘7 is an irrational 
number’. 

The conjunction of two formulas p and q is the formula p A q (‘p and q’), 
which is true if both p and q are true, false otherwise. The disjunction of p and 
q is the formula p V q (‘p or q’); the disjunction is false if p and q are both false, 
true in all other cases. Let for example p =‘7 is a rational number’ and q = ‘7 is 
an even number’; the formula p A q = ‘7 is an even rational number’ is false since 
q is false, and p V q = ‘7 is rational or even’ is true because p is true. 


Many statements in Mathematics are of the kind ‘If p is true, then q is true’, 
also read as ‘sufficient condition for g to be true is that p be true’, or ‘necessary 
condition for p to be true is that q be true’. Such statements are different ways 
of expressing the same formula p => gq (‘p implies q’, or ‘if p, then q’), called 
implication, where p is the ‘hypothesis’ or ‘assumption’, q the ‘consequence’ 
or ‘conclusion’. By definition, the formula p => q is false if p is true and q false, 
otherwise it is always true. In other words the implication does not allow to deduce 
a false conclusion from a true assumption, yet does not exclude a true conclusion 
being implied by a false hypothesis. Thus the statement ‘if it rains, ll take the 
umbrella’ prevents me from going out without umbrella when it rains, but will not 
interfere with my decision if the sky is clear. 

Using p and q it is easy to check that the formula p => q has the same truth 
value of =p V q. Therefore the connective = can be expressed in terms of the basic 
connectives — and V. 


Other frequent statements are structured as follows: ‘the conclusion q is true 
if and only if the assumption p is true’, or ‘necessary and sufficient condition for a 
true q is a true p’. Statements of this kind correspond to the formula p © q (‘p is 


6 1 Basic notions 


(logically) equivalent to q’), called logic equivalence. A logic equivalence is true 
if p and q are simultaneously true or simultaneously false, and false if the truth 
values of p and q differ. An example is the statement ‘a natural number is odd if 
and only if its square is odd’. The formula p © q is the conjuction of p => q and 
q => p, in other words p = q and (p > q) \(q => p) have the same truth value. 
Thus the connective < can be expressed by means of the basic connectives =, V 
and /. 


The formula p => q (a statement like ‘if p, then q’) can be expressed in various 
other forms, all logically equivalent. These represent rules of inference to attain 
the truth of the implication. For example, p => q is logically equivalent to the 
formula =q => 7p, called contrapositive formula; symbolically 


(p > q) (-q > 7p). 


This is an easy check: p => q is by definition false only when p is true and q false, 
i.e., when 7g is true and —p false. But this corresponds precisely to the falsehood 
of =q = -p. Therefore we have established the following inference rule: in order 
to prove that the truth of p implies the truth of g, one may assume that the 
conclusion q is false and deduce from it the falsehood of the assumption p. To 
prove for instance the implication ‘if a natural number is odd, then 10 does not 
divide it’, we may suppose that the given number is a multiple of 10 and (easily) 
deduce that the number must be even. 


A second inference rule is the so-called proof by contradiction, which we will 
sometimes use in the textbook. This is expressed by 


(p>q) = (pA-q=> 7p). 


In order to prove the implication p = q one can proceed as follows: suppose p is 
true and the conclusion q is false, and try to prove the initial hypothesis p false. 
Since p is also true, we obtain a self-contradictory statement. 

A more general form of the proof by contradiction is given by the formula 


pSq) — CAA S}r7 Ar), 


where r is an additional formula: the implication p = q is equivalent to assuming 
p true and q false, then deducing a simultaneously true and false statement r (note 
that the formula r A ar is always false, whichever the truth value of r). 

At last, we mention a further rule of inference, called Principle of Mathematical 
Induction, for which we refer to Appendix A.1, p. 427. 


1.2.2 Predicates 


Let us now introduce a central concept. A predicate is an assertion or property 
p(a,...) that depends upon one or more variables x,... belonging to suitable sets, 
and which becomes a formula (hence true or false) whenever the variables are 


1.2 Elements of mathematical logic 7 


fixed. Let us consider an example. If x is an element of the set of natural numbers, 
the assertion p(x) = ‘x is an odd number’ is a predicate: p(7) is true, p(10) false 
et c. If x and y denote students of the Polytechnic of Turin, the statement p(z, y) 
= ‘x and y follow the same lectures’ is a predicate. 

Observe that the aforementioned logic operations can be applied to predicates 
as well, and give rise to new predicates (e.g., —p(x), p(x) V q(x) and so on). This 
fact, by the way, establishes a precise relation among the essential connectives 
a,/,V and the set-theoretical operations of taking complements, intersection and 
union. In fact, recalling the definition A = {x € X | p(x)} of subset of a given 
set X, the ‘characteristic property’ p(x) of the elements of A is nothing else but 
a predicate, which is true precisely for the elements of A. The complement CA is 
thus obtained by negating the characteristic property 


CA= {a € X | ap(x)}, 


while the intersection and union of A with another subset B = {x € X | q(x)} are 
described respectively by the conjuction and the disjunction of the corresponding 
characteristic properties: 


ANB={rEX | p(x) Aq(z)}, AUB={xreEX | p(x) Vq(x)}. 


The properties of the set-theoretical operations recalled in the previous section 
translate into similar properties enjoyed by the logic operations, which the reader 
can easily write down. 


1.2.3 Quantifiers 


Given a predicate p(x), with the variable z belonging to a certain set X, one is 
naturally lead to ask whether p() is true for all elements x, or if there exists at 
least one element x making p(x) true. When posing such questions we are actually 
considering the formulas 


Va, p(x) (read ‘for all x, p(x) holds’ ) 


and 


dz, p(x) (read ‘there exists at least one x, such that p(x) holds’ ). 


If indicating the set to which x belongs becomes necessary, one writes ‘Vr € 
X, p(x)’ and ‘ax € X, p(x)’. The symbol V (‘for all’) is called universal quan- 
tifier, and the symbol 4 (‘there exists at least’) is said existential quantifier. 
(Sometimes a third quantifier is used, 4!, which means ‘there exists one and only 
one element’ or ‘there exists a unique’.) 

We wish to stress that putting a quantifier in front of a predicate transforms 
the latter in a formula, whose truth value may be then determined. The predicate 


8 1 Basic notions 


p(x) = ‘x is strictly less than 7’ for example, yields the false formula ‘Vz € N, p(x)’ 
(since p(8) is false, for example), while ‘Sa € N, p(x)’ is true (e.g., = 6 satisfies 
the assertion). 


The effect of negation on a quantified predicate must be handled with attention. 
Suppose for instance x indicates the generic student of the Polytechnic, and let p(x) 
= ‘x is an Italian citizen’. The formula ‘Vx, p(x)’ (‘every student of the Polytechnic 
has Italian citizenship’) is false. Therefore its negation ‘a(Vz, p(x))’ is true, but 
beware: the latter does not state that all students are foreign, rather that ‘there 
is at least one student who is not Italian’. Thus the negation of ‘Vz, p(a)’ is 
‘Jz, p(x)’. We can symbolically write 


a(Ve, p(t) => Ax, ap(z). 
Similarly, it is not hard to convince oneself of the logic equivalence 


St ee ee — ee 


If a predicate depends upon two or more arguments, each of them may be 
quantified. Yet the order in which the quantifiers are written can be essential. 
Namely, two quantifiers of the same type (either universal or existential) can be 
swapped without modifying the truth value of the formula; in other terms 


On the contrary, exchanging the places of different quantifiers usually leads to 
different formulas, so one should be very careful when ordering quantifiers. 

As an example, consider the predicate p(x, y) = ‘x > y’, with x, y varying in the 
set of natural numbers. The formula ‘Va Vy, p(x, y)’ means ‘given any two natural 
numbers, each one is greater or equal than the other’, clearly a false statement. 
The formula ‘Vz dy, p(x,y)’, meaning ‘given any natural number z, there is a 
natural number y smaller or equal than 2x’, is true, just take y = x for instance. 
The formula ‘dx Vy, p(x, y)’ means ‘there is a natural number x greater or equal 
than each natural number’, and is false: each natural number x admits a successor 
x +1 which is strictly bigger than x. Eventually, ‘dady, p(x,y)’ (‘there are at 
least two natural numbers such that one is bigger or equal than the other’) holds 
trivially. 


1.3 Sets of numbers 


Let us briefly examine the main sets of numbers used in the book. The discussion 
is on purpose not exhaustive, since the main properties of these sets should already 
be known to the reader. 


1.3 Sets of numbers 9 


The set N of natural numbers. This set has the numbers 0, 1, 2,... as elements. 
The operations of sum and product are defined on N and enjoy the well-known 
commutative, associative and distributive properties. We shall indicate by N, the 
set of natural numbers different from 0 


Ny =N\ {0}. 


A natural number n is usually represented in base 10 by the expansion n = c,l0*+ 
Cp_-1L0*-! + ---+0¢,10+ co, where the c,’s are natural numbers from 0 to 9 called 
decimal digits, the expression is unique if one assumes cy, # 0 when n 4 0. We 
shall write n = (cKCh—-1.--C1Co)10, or more easily n = cycp_1...cico. Any natural 
number > 2 may be taken as base, instead of 10; a rather common alternative is 
2, known as binary base. 

Natural numbers can also be represented geometrically as points on a straight 
line. For this it is sufficient to fix a first point O on the line, called origin, and 
associate it to the number 0, and then choose another point P different from 
O, associated to the number 1. The direction of the line going from O to P is 
called positive direction, while the length of the segment OP is taken as unit for 
measurements. By marking multiples of OP on the line in the positive direction 
we obtain the points associated to the natural numbers (see Fig. 1.4). 


The set Z of integer numbers. This set contains the numbers 0,+1,-—1, 
+2,—2,... (called integers). The set N can be identified with the subset of Z 
consisting of 0,+1,+2,... The numbers +1,+2,... (—1,—2,...) are said positive 
integers (resp. negative integers). Sum and product are defined in Z, together with 
the difference, which is the inverse operation to the sum. 

An integer can be represented in decimal base z = +cpcp_1...C1C9. The geo- 
metric picture of negative integers extends that of the natural numbers to the left 
of the origin (Fig. 1.4). 


The set Q of rational numbers. A rational number is the quotient, or ratio, 

of two integers, the second of which (denominator) is non-zero. Without loss of 

generality one can assume that the denominator is positive, whence each rational 
number, or rational for simplicity, is given by 

z ‘ 

ares with z€ ZandneN,. 


Moreover, one may also suppose the fraction is reduced, that is, z and n have no 
common divisors. In this way the set Z is identified with the subset of rationals 


Figure 1.4. Geometric representation of numbers 


10 1 Basic notions 


whose denominator is 1. Besides sum, product and difference, the operation of 
division between two rationals is defined on Q, so long as the second rational is 
other than 0. This is the inverse to the product. 

A rational number admits a representation in base 10 of the kind r = 


+cCpRCK-1 +++ €1C€9.d,d2---, corresponding to 
r= +(cp10" + ee q0" +---+e10+c+ a10-* + real +-- -). 
The sequence of digits d,,d2,... written after the dot satisfies one and only one of 


the following properties: i) all digits are 0 from a certain subscript 7 > 1 onwards (in 
which case one has a finite decimal expansion; usually the zeroes are not written), 
or ii) starting from a certain point, a finite sequence of numbers not all zero — 
called period — repeats itself over and over (infinite periodic decimal expansion; 
the period is written once with a line drawn on top). For example the following 
expressions are decimal expansions of rational numbers 


35163 11579 
Se SS 00S BTL = 
aT 351.6300 871.63 and 


The expansion of certain rationals is not unique. If a rational number has a finite 
expansion in fact, then it also has a never-ending periodic one obtained from the 
former by reducing the right-most non-zero decimal digit by one unit, and adding 
the period 9. The expansions 1.0 and 0.9 define the same rational number 1; 
similarly, 8.357 and 8.3569 are equivalent representations of a. 

The geometric representation of a rational r = + is obtained by subdividing 
the segment OP in n equal parts and copying the subsegment m times in the 
positive or negative direction, according to the sign of r (see again Fig. 1.4). 


= 12.51783783 «++= 12.51783. 


The set R of real numbers. Not every point on the line corresponds to a rational 
number in the above picture. This means that not all segments can be measured 
by multiples and sub-multiples of the unit of length, irrespective of the choice of 
this unit. 

It has been known since the ancient times that the diagonal of a square is not 
commensurable with the side, meaning that the length d of the diagonal is not a 
rational multiple of the side’s length @. To convince ourselves about this fact recall 
Pythagoras’s Theorem. It considers any of the two triangles in which the diagonal 
splits the square (Fig. 1.5), and states that 


@=P +h, ie, a =20. 


0 eg Vf2e 


Figure 1.5. Square with side @ and its diagonal 


1.3. Sets of numbers 11 


Calling p the ratio between the lengths of diagonal and side, we square d = pé and 
substitute in the last relation to obtain p? = 2. The number p is called the square 
root of 2 and it is indicated by the symbol V2. 


Property 1.1 Jf the number p satisfies p? = 2, it must be non-rational. 


Proof. By contradiction: suppose there exist two integers m and n, necessarily 
non-zero, such that p = ™. Assume m, n are relatively prime. Taking 
2 


. 2 . . . 
squares we obtain 7 = 2, hence m? = 2n?. Thus m? is even, which is to 


say that m is even. For a suitable natural number k then, m = 2k. Using 
this in the previous relation yields 4k? = 2n?, i.e., n? = 2k”. Then n?, 
whence also n, is even. But this contradicts the fact that m and n have no 
common factor, which comes from the assumption that p is rational. 


Another relevant example of incommensurable lengths, known for centuries, 
pertains to the length of a circle measured with respect to the diameter. In this 
case as well, one can prove that the lengths of circumference and diameter are 
not commensurable because the proportionality factor, known by the symbol 7, 
cannot be a rational number. 


The set of real numbers is an extension of the rationals and provides a math- 
ematical model of the straight line, in the sense that each real number x can be 
associated to a point P on the line uniquely, and vice versa. The former is called 
the coordinate of P. There are several equivalent ways of constructing such exten- 
sion. Without going into details, we merely recall that real numbers give rise to any 
possible decimal expansion. Real numbers that are not rational, called zrrational, 
are characterised by having a non-periodic infinite decimal expansion, like 


V2 = 1.4142135623731 -- - and m™ = 3.1415926535897 - - - 


Rather than the actual construction of the set R, what is more interesting to us 
are the properties of real numbers, which allow one to work with the reals. Among 
these properties, we recall some of the most important ones. 


i) The arithmetic operations defined on the rationals extend to the reals with 
similar properties. 
ii) The order relation x < y of the rationals extends to the reals, again with similar 
features. We shall discuss this matter more deeply in the following Sect. 1.3.1. 
iii) Rational numbers are dense in the set of real numbers. This means there are 
infinitely many rationals sitting between any two real numbers. It also implies 
that each real number can be approximated by a rational number as well 
as we please. If for example r = cecg_1 +++ C1C9-did2---djdj41--+ has a non- 
periodic infinite decimal expansion, we can approximate it by the rational 
di = CkCk—1°+ + €1Co-did2---d; obtained by ignoring all decimal digits past the 
ith one; as 7 increases, the approximation of r will get better and better. 


12 1 Basic notions 


iv) The set of real numbers is complete. Geometrically speaking, this is equivalent 
to asking that each point on the line is associated to a unique real number, as 
already mentioned. Completeness guarantees for instance the existence of the 
square root of 2, i.e., the solvability in R of the equation x? = 2, as well as of 
infinitely many other equations, algebraic or not. We shall return to this point 
in Sect. 1.3.2. 


1.3.1 The ordering of real numbers 


Non-zero real numbers are either positive or negative. Positive reals form the 
subset R+, negative reals the subset IR_. We are thus in presence of a partition 


i? 


R= R_U{0}URy,. The set 


R, = {O} UR, 


of non-negative reals will also be needed. Positive numbers correspond to points 
on the line lying at the right — with respect to the positive direction — of the origin. 

Instead of x € R,, one simply writes x > 0 (‘x is bigger, or larger, than 
0’); similarly, « € R, will be expressed by x > 0 (‘x is bigger or equal than 0’). 
Therefore an order relation is defined by 


ay — y—xz> 0. 


This is a total ordering, i.e., given any two distinct reals x and y, one (and only 
one) of the following holds: either x < y or y < x. From the geometrical point of 
view the relation x < y tells that the point with coordinate x is placed at the left 
of the point with coordinate y. Let us also define 


r<y => eWay Or =, 


Clearly, « < y implies x < y. For example the relations 3 < 7 and 7 < 7 are true, 
whereas 3 < 2 is not. 


The order relation < (or <) interacts with the algebraic operations of sum and 
product as follows: 


if ¢ < y and z is any real number, then 7+ z<y+z 


(adding the same real number to both sides of an inequality leaves the latter 
unchanged); 


2-20, then 12 =< 72. 


lia = 7 and if 


2-0. then 77 2 az 


(multiplying by a non-negative number both sides of an inequality does not alter it, 
while if the number is negative it inverts the inequality). Example: multiplying by 
—1 the inequality —3 < 2 gives —2 < 3. The latter property implies the well-known 


1.3 Sets of numbers 13 


sign rule: the product of two numbers with alike signs is positive, the product of 
two numbers of different sign is negative. 


Absolute value. Let us introduce now a simple yet important notion. Given a 
real number z, one calls absolute value of x the real number 


Thus |x| > 0 for any x in R. For instance |5| = 5, |0| = 0, |—5| = 5. Geometrically, 
|x| represents the distance from the origin of the point with coordinate x; thus, 
|x — y| = |y — z| is the distance between the two points of coordinates x and y. 


The following relations, easy to prove, will be useful 
le + y| < |x| + |yl, for allz,yER (1.1) 
(called triangle inequality) and 


eu) — \ar|lal for all z,y ER. 


Throughout the text we shall solve equations and inequalities involving abso- 
lute values. Let us see the simplest ones. According to the definition, 


|z| = 0 
has the unique solution x = 0. If a is any number > 0, the equation 
Jz] =a 


has two solutions x = a and x = —a, so 


In order to solve 


el Sm where a > 0, 


consider first the solutions x > 0, for which |z| = x, so that now the inequality 
reads x < a; then consider x < 0, in which case |x| = —x, and solve —x < a, or 
—a < x. To summarise, the solutions are real numbers x satisfying 0 < x < a or 
—a <x <0, which may be written in a shorter way as 


kal oe — —-@5 05 a, (1,2) 


14 1 Basic notions 
Similarly, it is easy to see that if b > 0, 
el eo cs 7 —0 of 2 = 0. (1.3) 
The slightly more general inequality 
|x — xo| <a, 

where zo € R is fixed and a > 0, is equivalent to —a < x—2o < a; adding x9 gives 

lz —xp| <a — Xj -a<u<ata. (1.4) 
In all examples we can replace the symbol < by < and the conclusions hold. 
Intervals. The previous discussion shows that Mathematical Analysis often deals 


with subsets of R whose elements lie between two fixed numbers. They are called 
intervals. 


Definition 1.2 Let a and b be real numbers such that a < b. The closed 
interval with end-points a, b is the set 


a,b. ={z eR | axa < 6}. 
Ifa <b, one defines open interval with end-points a, b the set 


(a,b) = {re R la =< x < 8}. 


An equivalent notation is ja, OI. 


If one includes only one end-point, then the interval with end-points a, b 
a,b) = {7 eR las a < b} 

is called half-open on the right, while 
(@, b= {we a =a =< b. 


is half-open on the left. 


— oe 
a b a b 


Figure 1.6. Geometric representation of the closed interval |a, 6] (left) and of the open 
interval (a,b) (right) 


1.3 Sets of numbers 15 


Example 1.3 
Describe the set A of elements x € R such that 
2 lee De 
Because of (1.2) and (1.3), we easily have 
A = (—5, —2] U [2, 5). 


Intervals defined by a single inequality are useful, too. Define 


[a, too) = {x ER | a<zt}, (a, too) ={rER|a<z}, 


and 
(—co,b] = {x ER | x < 5}, (—o0,b)={rER|a< bd}. 


The symbols —oo and +oo do not indicate real numbers; they allow to extend 
the ordering of the reals with the convention that —co < x and x < +oo for all 
x € R. Otherwise said, the condition a < x is the same as a < x < +00, so the 
notation [a, +00) is consistent with the one used for real end-points. Sometimes it 
is convenient to set 

(—oo, too) = R. 


In general one says that an interval I is closed if it contains its end-points, open 
if the end-points are not included. All points of an interval, apart from the end- 
points, are called interior points. 


Bounded sets. Let us now discuss the notion of boundedness of a set. 


Definition 1.4 A subset A of R is called bounded from above if there 
exists a real number b such that 


0, for alla EA. 


Any b with this property is called an upper bound of A. 
The set A is bounded from below if there is a real number a with 


Cae, for alla € A. 


Every a satisfying this relation is said a lower bound of A. 
At last, one calls A bounded if it is bounded from above and below. 


In terms of intervals, a set is bounded from above if it is contained in an interval 
of the sort (—oo, 6] with b € R, and bounded if it is contained in an interval [a, }] 
for some a,b € R. It is not difficult to show that A is bounded if and only if there 
exists a real c > 0 such that 


a) for alla € A. 


16 1 Basic notions 


Examples 1.5 


i) The set N is bounded from below (each number a < 0 is a lower bound), but 
not from above: in fact, the so-called Archimedean property holds: for any 
real b > 0, there exists a natural number n with 


n> Dd. (1.5) 


ii) The interval (—oo, 1] is bounded from above, not from below. The interval 
(—5, 12) is bounded. 


iii) The set 
n LD 3 
A= eae, = ig ig is actin 1.6 
{aay inent Ort } in) 


is bounded, in fact 0 < = <1 for any nEN. 
n+i1 


iv) The set B = {x € Q| x? < 2} is bounded. Taking z such that |z| > 3 for 


example, then x? > 2 >2,sor¢B. Thus BC [-3, 3). 


Definition 1.6 A set A C R admits a maximum if an element xy € A 
exists such that 
WEES AE for anyx € A. 


The element xyz (necessarily unique) is the maximum of the set A and 
one denotes it by xy = max A. 

The minimum of a set A, denoted by t» = min A, is defined in a similar 
way. 


A set admitting a maximum must be bounded from above: the maximum is an 
upper bound for the set, actually the smallest of all possible upper bounds, as we 
shall prove. The opposite is not true: a set can be bounded from above but not 
admit a maximum, like the set A of (1.6). We know already that 1 is an upper 
bound for A. Among all upper bounds, 1 is privileged, being the smallest upper 
bound. To convince ourselves of this fact, let us show that each real number r < 1 
is not an upper bound, i.e., there is a natural number n such that 


n 
n+i1 


1 1 1 1 1 1- 
ns < -, hence l1+-—-<-,or-—< z 
r nm r nm 


>. 


. This 


The inequality is equivalent to 


is to say n > , and the existence of such n follows from property (1.5). So, 
1 is the smallest upper bound of A, yet not the maximum, for 1 ¢ A: there is no 


natural number n such that 


| = 1. One calls 1 the supremum, or least upper 


n 
bound, of A and writes 1 = sup A. 


1.3 Sets of numbers 17 


Analogously, 2 is the smallest of upper bounds of the interval J = (0,2), but 
it does not belong to J. Thus 2 is the supremum, or least upper bound, of J, 
2=supl. 


Definition 1.7 Let A C R be bounded from above. The supremum or least 
upper bound of A is the smallest of all upper bounds of A, denoted by sup A. 


If A C R is bounded from below, one calls infimum or greatest lower 
bound of A the largest of all lower bounds of A. This is denoted by inf A. 


The number s = sup A is characterised by two conditions: 


i) a2<sforallzeA; 


it) for any realr <8, there isanzxé€A witha >r. 


While 2) tells that s is an upper bound for A, according to iz) each number smaller 
than s is not an upper bound for A, rendering s the smallest among all upper 
bounds. 

The two conditions (1.7) must be fulfilled in order to show that a given number 
is the supremum of a set. That is precisely what we did to claim that 1 was the 
supremum of (1.6). 

The notion of supremum generalises that of maximum of a set. It is immediate 
to see that if a set admits a maximum, this maximum must be the supremum 
as well. 


If a set A is not bounded from above, one says that its supremum is +00, i.e., 
one defines 
sup A = +00. 


Similarly, inf A = —oo for a set A not bounded from below. 


1.3.2 Completeness of R 


The property of completeness of R may be formalised in several equivalent ways. 
The reader should have already come across (Dedekind’s) separability axiom: de- 
composing R into the union of two disjoint subsets C; and C2 (the pair (C1, C2) 
is called a cut) so that each element of C; is smaller or equal than every element 
in C2, there exists a (unique) separating element s € R: 


Ty <8 <2, Var4 E C1, Varo E C. 


An alternative formulation of completeness involves the notion of supremum of 
a set: every bounded set from above admits a supremum in R, i.e., there is a real 
number smaller or equal than all upper bounds of the set. 


With the help of this property one can prove, for example, the existence in 
R of the square root of 2, hence of a number p (> 0) such that p? = 2. Going 


18 1 Basic notions 


back to Example 1.5 iv), the completeness of the reals ensures that the bounded 
set B = {x € Q | x? < 2} has a supremum, say p. Using the properties of R it 
is possible to show that p? < 2 cannot occur, otherwise p would not be an upper 
bound for B, and neither p? > 2 holds, for p would not be the least of all upper 
bounds. Thus necessarily p? = 2. Note that B, albeit contained in Q, is not allowed 
to have a rational upper bound, because p? = 2 prevents p from being rational 
(Property 1.1). 


This example explains why the completeness of R lies at the core of the pos- 
sibility to solve in R many remarkable equations. We are thinking in particular 
about the family of algebraic equations 


f° Sa, (1.8) 


where n € N, and a € R, for which it is worth recalling the following known fact. 


Property 1.8 i) Letn © Nx be odd. Then for any a € R equation (1.8) has 
exactly one solution in R, denoted by x = v/a or x = a'/” and called the nth 
root of a. 


ui) Letn € Ny be even. For any a > 0 equation (1.8) has two real solutions 


with the same absolute value but opposite signs; when a = 0 there is one 
solution x = 0 only; for a < 0 there are no solutions in R. The non-negative 
solution is indicated by x = %/a or x = a'/", and called the nth (arithmetic) 
root of a. 


1.4 Factorials and binomial coefficients 


We introduce now some noteworthy integers that play a role in many areas of 
Mathematics. 

Given a natural number n > 1, the product of all natural numbers between 
1 and n goes under the name of factorial of n and is indicated by n! (read ‘n 
factorial’). Out of conveniency one sets 0! = 1. Thus 


fia iD eer — 1 tok i 2 (1.9) 


Factorials grow extremely rapidly as n increases; for instance 5! = 120, 10! = 
3628800 and 100! > 101°”. 


Example 1.9 


Suppose we have n > 2 balls of different colours in a box. In how many ways 
can we extract the balls from the box? 


1.4 Factorials and binomial coefficients 19 


When taking the first ball we are making a choice among the n balls in the box; 

the second ball will be chosen among the n — 1 balls left, the third one among 

n—2 and so on. Altogether we have n(n —1)-...:2-1 =n! different ways to 

extract the balls: n! represents the number of arrangements of n distinct objects 

in a sequence, called permutations of n ordered objects. 

If we stop after k extractions, 0 < k <n, we end up with n(n—1)...(n—k+1) 
! 


possible outcomes. The latter expression, also written as is the number 


n! 
(n—k)V 
of possible permutations of n distinct objects in sequences of k objects. 
If we allow repeated colours, for instance by reintroducing in the box a ball of 
the same colour as the one just extracted, each time we choose among n. After 
k > 0 choices there are then n” possible sequences of colours: n® is the number 
of permutations of n objects in sequences of k, with repetitions (i.e., 
allowing an object to be chosen more than once). 


Given two natural numbers n and k such that 0 < k <n, one calls binomial 
coefficient the number 


(1.10) 


(the symbol (‘.) is usually read ‘n choose k’). Notice that if0<k <n 
ml=1-...n=1-...-(n—k)(n-—k4+1)-...-.(n—-1)n = (n—k)'(n—k4+1)-...-(n-1)n, 


so simplifying and rearranging the order of factors at the numerator, (1.10) be- 
comes 


(1.11) 


another common expression for the binomial coefficient. From definition (1.10) it 
follows directly that 


and 


= G)=» C= G2i)=- 


Moreover, it is easy to prove that for any n > 1 and any k withO<k<n 


Seni Ge! (1.12) 


which provides a convenient means for computing binomial coefficients recursively; 
the coefficients relative to n objects are easily determined once those involving 
n — 1 objects are computed. The same formula suggests to write down binomial 


20 1 Basic notions 


coefficients in a triangular pattern, known as Pascal’s triangle! (Fig. 1.7): each 
coefficient of a given row, except for the 1’s on the boundary, is the sum of the two 
numbers that lie above it in the preceding row, precisely as (1.12) prescribes. The 
construction of Pascal’s triangle shows that the binomial coefficients are natural 
numbers. 


Figure 1.7. Pascal’s triangle 


The term ‘binomial coefficient’ originates from the power expansion of the 
polynomial a+ 6b in terms of powers of a and b. The reader will remember the 
important identities 


(a+b)? =a? + 2ab+ b? and (a+b)? = a? + 3a7b + 3ab? + b°. 


The coefficients showing up are precisely the binomial coefficients for n = 2 and 
n = 3. In general, for any n > 0, the formula 


holds, known as (Newton’s) binomial expansion. This formula is proven with 
(1.12) using a proof by induction (see Appendix A.1, p. 428). 


Example 1.9 (continuation) 


Given n balls of different colours, let us fix k with 0 < k < n. How many different 
sets of k balls can we form? 

Extracting one ball at a time for k times, we already know that there are 
n(n —1)...(n—k+1) outcomes. On the other hand the same k balls, extracted 
in a different order, will yield the same set. Since the possible orderings of k 


elements are k!, we see that the number of distinct sets of k balls chosen from n 


—1)-...-(n-—k4+l1 
S mnt ee = : . This coefficient represents the number of 


combinations of n objects taken k at a time. Equivalently, the number of 
subsets of k elements of a set of cardinality n. 


' Sometimes the denomination Tartaglia’s triangle appears. 


1.5 Cartesian product 21 


Formula (1.13) with a = b = 1 shows that the sum of all binomial coefficients 
with n fixed equals 2”, non-incidentally also the total number of subsets of a set 
with n elements. 


1.5 Cartesian product 


Let X, Y be non-empty sets. Given elements x in X and y in Y, we construct the 
ordered pair of numbers 


(x,y), 


whose first component is x and second component is y. An ordered pair is concep- 
tually other than a set of two elements. As the name says, in an ordered pair the 
order of the components is paramount. This is not the case for a set. If x # y the 
ordered pairs (x,y) and (y, x) are distinct, while {x,y} and {y, x} coincide as sets. 

The set of all ordered pairs (x,y) when x varies in X and y varies in Y is the 
Cartesian product of X and Y, which is indicated by X x Y. Mathematically, 


XY Ge XN, ye 


The Cartesian product is represented using a rectangle, whose basis corres- 
ponds to the set X and whose height is Y (as in Fig. 1.8). 


If the sets X, Y are different, the product X x Y will not be equal to Y x X, 
in other words the Cartesian product is not commutative. 
But if Y = X, it is customary to put X x X = X? for brevity. In this case the 
subset of X? 
A= {(z,y) € X* | «=y} 


of pairs with equal components is called the diagonal of the Cartesian product. 


Figure 1.8. Cartesian product of sets 


22 1 Basic notions 


The most significant example of Cartesian product stems from X = Y = R. The 
set R? consists of ordered pairs of real numbers. Just as the set R mathematically 
represents a straight line, so R? is a model of the plane (Fig. 1.9, left). In order 
to define this correspondence, choose a straight line in the plane and fix on it an 
origin O, a positive direction and a length unit. This shall be the z-azis. Rotating 
this line counter-clockwise around the origin by 90° generates the y-axis. In this 
way we have now an orthonormal frame (we only mention that it is sometimes 
useful to consider frames whose axes are not orthogonal, and/or the units on the 
axes are different). 

Given any point P on the plane, let us draw the straight lines parallel to the 
axes passing through the point. Denote by x the real number corresponding to the 
intersection of the x-axis with the parallel to the y-axis, and by y the real number 
corresponding to the intersection of the y-axis with the parallel to the z-axis. An 
ordered pair (x,y) € R? is thus associated to each point P on the plane, and vice 
versa. The components of the pair are called (Cartesian) coordinates of P in the 
chosen frame. 


The notion of Cartesian product can be generalised to the product of more 
sets. Given n non-empty sets X1, X9,...,X,, one considers ordered n—tuples 


(11,22, rar ogibe) 


where, for every 7 = 1,2,...,n, each component x; lives in the set X;. The 
Cartesian product X; x X2 x... x Xy is then the set of all such n—tuples. 

When Xj = Xg =... = Xn = X one simply writes X x X k...x X =X”. 
In particular, R® is the set of triples (x,y,z) of real numbers, and represents a 
mathematical model of three-dimensional space (Fig. 1.9, right). 


A 
z& 
A \ 
\ 
\ 
\ 
\ 
(x,y) ‘ 
Cee eraser se y ® (2,0; 2) 
| 
i | 
| | 
| | 
| | 
| | 
| — | 
x e 3 
~ | 
— x | 
ban \ | 
“ie Ql 
sae 


Figure 1.9. Models of the plane (left) and of space (right) 


1.6 Relations in the plane 23 


1.6 Relations in the plane 


We call Cartesian plane a plane equipped with an orthonormal Cartesian frame 
built as above, which we saw can be identified with the product R?. 

Every non-empty subset R of R? defines a relation between real numbers; 
precisely, one says x is R-related to y, or x is related to y by R, if the ordered 
pair (z,y) belongs to R. The graph of the relation is the set of points in the plane 
whose coordinates belong to R. 

A relation is commonly defined by one or more (in)equalities involving the 
variables x and y. The subset R is then defined as the set of pairs (x, y) such that 
x and y satisfy the constraints. Finding R often means determining its graph in 
the plane. Let us see some examples. 


Examples 1.10 
i) An equation like 
ax + by = c, 
with a, b constant and not both vanishing, defines a straight line. If b = 0, the line 


is parallel to the y-axis, whereas a = 0 yields a parallel to the x-axis. Assuming 
b £0 we can write the equation as 
y=me + 4q, 

where m = —¢ and q = §. The number m is called slope of the line. The line 
can be plotted by finding the coordinates of two points that belong to it, hence 
two distinct pairs (x,y) solving the equation. In particular c = 0 (or gq = 0) if 
and only if the origin belongs to the line. The equation x — y = 0 for example 
defines the bisectrix of the first and third quadrants of the plane. 


ii) Replacing the ‘=’ sign by ‘<’ above, consider the inequality 

ax + by <c. 
It defines one of the half-planes in which the straight line of equation ax + by = c 
divides the plane (Fig. 1.10). If b > 0 for instance, the half-plane below the line 
is obtained. This set is open, i.e., it does not contain the straight line, since the 


inequality is strict. The inequality ax + by < c defines instead a closed set, i-e., 
including the line. 


Figure 1.10. Graph of the relation of Example 1.10 ii) 


24 1 Basic notions 


iii) The system 


y > 0, 
LY 2 0, 

defines the intersection between the open half-plane above the z-axis and the 
closed half-plane lying below the bisectrix of the first and third quadrants. Thus 
the system describes (Fig. 1.11, left) the wedge between the positive x-axis and 
the bisectrix (the points on the z-axis are excluded). 
iv) The inequality 

Jz —y| <2 
is equivalent, recall (1.2), to 

=A 8 = y < 2. 

The inequality on the left is in turn equivalent to y < «+2, so it defines the open 
half-plane below the line y = x + 2; similarly, the inequality on the right is the 
same as y > x—2 and defines the open half-plane above the line y = x—2. What 
we get is therefore the strip between the two lines, these excluded (Fig. 1.11, 
right). 
v) By Pythagoras’s Theorem, the equation 

oe? +y? =1 
defines the set of points P in the plane with distance 1 from the origin of the 
axes, that is, the circle centred at the origin with radius 1 (in trigonometry it 
goes under the name of unit circle). The inequality 

ae? +t y? <1 
then defines the disc bounded by the unit circle (Fig. 1.12, left). 


vi) The equation 


yaa? 


yields the parabola with vertical axis, vertex at the origin and passing through 
the point P of coordinates (1, 1). 


y=u+2 


y=xr—2 


y=0 


Figure 1.11. Graphs of the relations of Examples 1.10 iii) (left) and 1.10 iv) (right) 


gt y?=1 


1.7 Exercises 


—_ 


a 


25 


Figure 1.12. Graphs of the relations in Examples 1.10 v) (left) and 1.10 vi) (right) 


Thus the inequalities 


a <y<l 


define the region enclosed by the parabola and by the straight line given by y = 1 


(Fig. 1.12, right). 


1.7 Exercises 


1. Solve the following inequalities: 


2x —1 
>0 
zr —3 
xr—-1 28 
xr—2 r—3 
€) 


[)] Je =a]-220 


2. Describe the following subsets of R: 


2043 t+ 1 
e+5 ~ |e-1| 


1—7x 
38x+5 


|x| r+1 
z—1° M—1 
f) Va2-62% >24+2 


r+3 


= 


/ |x? — 4 
fy EEA aah 


xe? —A4 


A={rER:22+4r4+13<0}N{x eR: 32?+5>0} 


3a +1 
b) B={weER: (e+ 2)(w—1)(e—5) <0}N{eER: > > 0} 
x? —5a+4 
C = {e eR: —J—— <0} U {ee R: Vfet1+2=17} 
CoS 


d) D={reER: 24-45 Va?7—-6r4+5}U{@eER:4+2> Vx-1} 


—______—. > 0 
eyes 


26 1 Basic notions 


3. Determine and draw a picture of the following subsets of R?: 


a)) A={(e,y) € R?: xy > 0} b) B={(z,y) € R? : 2? —y? > 0} 


C={(a,y)€R?:|y—2?|<1} d) D={(a,y)e ge a?+ E> 1} 


[e)| B={(@y) €R?:1+42y > 0} f) F={(2,y) €R?:2-—y 40} 


4. Tell whether the following subsets of R are bounded from above and/or below, 
specifying upper and lower bounds, plus maximum and minimum (if existent): 


1 
A={xeER:r=norz=-z,neEN \ {O}} 
n 
b) B={xER:-1l<2<lorx=20} 


2n —3 
CSi#eksl<¢=lore= ——=, ne N\ {0, 1}} 


d) D={zeER:z=cy withz,yeR, -1<2<2,-3<y<-1} 


1.7.1 Solutions 


1. Inequalities: 


a) This is a fractional inequality. A fraction is positive if and only if numerator 
and denominator have the same sign. As N(x) = 2a—1> 0 if « > 1/2, and 
D(x) =x —3> 0 for x > 3, the inequality holds when xz < 1/2 or x > 3. 

b) -B<a<d. 

c) Shift all terms to the left and simplify: 

e-1 We=3 =o" 363 


>0 1.€. ————— > 0. 
r-2 2-8 a (x — 2)(a — 3) 


The roots of the numerator are not real, so N(x) < 0 always. The inequality 
thus holds when D(a) < 0, hence 2 < x < 3. 


d) Moving terms to one side and simplifying yields: 


|x| +1 |z|(Qz2 —1) —2? +1 
= =U, Ox re 
z—-1 2-1 en (c@ —1)(2 —1) 
Since |z| = x for x > 0 and |z| = —ax for x < 0, we study the two cases 


separately. 
When zx > 0 the inequality reads 


2e*-a-ai+l v?—x+1 a 
__ _—. r —$_____—__. 
(xz — 1)(2z — 1) (x —1)(2z —1) 


1.7 Exercises 27 


The numerator has no real roots, hence x? — x + 1 > 0 for all x. Therefore 
the inequality is satisfied if the denominator is positive. Taking the constrain 
x > 0 into account, this means 0 < x < 1/2 oraz > 1. 
When x < 0 we have 

277 +2-27+1 —327 +241 


@=N@ey PP a TNee) 7” 


N(a) is annihilated by x1 = 1-vi3 and r2 = 1+ V3 | so N(x) > O fort <a4< 
2 (notice that x; <0 and x2 € ($,1)). As above the denominator is positive 
when x < 1/2 and x > 1. Keeping x < 0 in mind, we have x; < x < 0. 

The initial inequality is therefore satisfied by any x € (#1, 5) U (1, +00). 

e) —j2p<-0, oi ged leee Bee. f)a<—é. 

g) First of all observe that the right-hand side is always > 0 where defined, hence 
when x? — 2x > 0, ie., « <0 or x > 2. The inequality is certainly true if the 
left-hand side x — 3 is < 0, so for x < 3. 

If x —3 > 0, we take squares to obtain 


xe? — 62 +9 <2? — 22, 1.e., 4r > 9, whence x> 


HILO 


Gathering all information we conclude that the starting inequality holds 
wherever it is defined, that is for x <0 and x > 2. 

h) x € [-3, -V3) U (v3, +00). 

i) As |x? — 4] > 0, \/|x? — 4] is well defined. Let us write the inequality in the 


form 
V|a2—4| >a. 


If x < 0 the inequality is always true, for the left-hand side is positive. If x > 0 
we square: 
|ja? —4| > a, 
Note that 
; g=-4 fe <—-2orx7> 2, 
jz* — 4] = 
—o' +4 if-2<2<2. 
Consider the case x > 2 first; the inequality becomes x? — 4 > 2”, which is 
never true. 
Let now 0 < x < 2; then —2? +4 > x”, hence x? —2 <0. Thus0< 2 < V2 
must hold. 
In conclusion, the inequality holds for x < sD. 


() x € (—2,-V/2) U (2, +00). 


2. Subsets of R: 


a) Because 2? + 4x + 13 = 0 cannot be solved over the reals, the condition 
x? + 4dr + 13 < 0 is never satisfied and the first set is empty. On the other 
hand, 3x2 +5 > 0 holds for every x € R, therefore the second set is the whole 
R. Thus A=Q@NR=9. 


28 


b) 
c) 


d) 


a) 


b) 
c) 


1 Basic notions 


B = (—oo, —2) U (2, 5). 
We can write 
e?—5e+4  (#—4)(e-1) 
22-9  (#—3)(x+8)’ 
whence the first set is (—3, 1) U (3,4). 


To find the second set, let us solve the irrational equation “7x +1+ 2 = 17, 


which we write as 7x + 1 = 17-2. The radicand must necessarily be positive, 


hence x > —4. Moreover, a square root is always > 0, so we must impose 


17-2 >0,ie., x < 17. Thus for —t <a < 17, squaring yields 
(es 1= O07). a? — 41x + 288 =0. 


The latter equation has two solutions 7; = 9, r2 = 32 (which fails the con- 
straint x < 17, and as such cannot be considered). The second set then contains 
only 2 = 9. 

Therefore C = (—3,1) U (3,4) U {9}. 


D = [1, +00). 


3. Subsets of R?: 
The condition holds if x and y have equal signs, thus in the first and third 
quadrants including the axes (Fig. 1.13, left). 
See Fig. 1.13, right. 
We have 
2 yY— x? if Yy Bs ae ’ 
ly-a)|=4", , 
ze—y ify<a*. 
Demanding y > x? means looking at the region in the plane bounded from 
below by the parabola y = x”. There, we must have 


gor = 1, L€., ye? 1, 


Figure 1.13. The sets A and B of Exercise 3 


d) 
e) 


f) 


1.7 Exercises 29 


Figure 1.14. The sets C and D of Exercise 3 


that be yee +1, 
Vice versa if y < x?, 


oy <1. Le., i ea 


hence 27 —1<y< 2. 

Eventually, the required region is confined by (though does not include) the 
parabolas y = 2? — 1 and y = x? +1 (Fig. 1.14, left). 

See Fig. 1.14, right. 


For x > 0 the condition 1+ xy > 0 is the same as y > —1. Thus we consider 
all points of the first and third quadrants above the hyperbola y = —+ 


x 
For « < 0,1+ay > 0 means y < —1, satisfied by the points in the second and 
fourth quadrants this time, lying below the hyperbola y = —+ 


a 


At last, if « =0, 1+ ay >0 holds for any y, implying that the y-axis belongs 
to the set EF. 

Therefore: the region lies between the two branches of the hyperbola (these 
are not part of FE) y = —1, including the y-axis (Fig. 1.15, left). 

See Fig. 1.15, right. 


Figure 1.15. The sets & and F of Exercise 3 


30 


1 Basic notions 


4. Bounded and unbounded sets: 


a) 


d) 


We have A = {1,2,3,...,4,4,7g,---}. Since N \ {0} C A, the set A is not 
bounded from above, hence sup A = +o0 and there is no maximum. 

In addition, the fact that every element of A is positive makes A bounded from 
below. We claim that 0 is the greatest lower bound of A. In fact, if r > 0 were 
a lower bound of A, then + > r for any non-zero n € N. This is the same as 
n< 4, hence n < Fr But the last inequality is absurd since natural numbers 
are not bounded from above. Finally 0 ¢ A, so we conclude inf A = 0 and A 
has no minimum. 


inf B = —1, sup B = max B = 20, and min B does not exist. 


G= 0,1) U {8, 3, t, 2, ...} C [0, 2); then C' is bounded, and inf C = minC = 


Oi 
0. Since = 2— 7 it is not hard to show that sup C = 2, although 
n— n— 
there is no maximum in C. 


inf C = minC' = —6, sup B = max B = 3. 


2 


Functions 


Functions crop up regularly in everyday life (for instance: each student of the 
Polytechnic of Turin has a unique identification number), in physics (to each point 
of a region in space occupied by a fluid we may associate the velocity of the particle 
passing through that point at a given moment), in economy (each working day at 
Milan’s stock exchange is tagged with the Mibtel index), and so on. 

The mathematical notion of a function subsumes all these situations. 


2.1 Definitions and first examples 


Let X and Y be two sets. A function f defined on X with values in Y is 
a correspondence associating to each element x € X at most one element y € Y. 
This is often shortened to ‘a function from X to Y’. A synonym for function is 
map. The set of x € X to which f associates an element in Y is the domain of 
f; the domain is a subset of X, indicated by dom f. One writes 


f:domfCcxXx Y. 


If dom f = X, one says that f is defined on X and writes simply f: X ~ Y. 
The element y € Y associated to an element x € dom f is called the image of 
x by or under f and denoted y = f(x). Sometimes one writes 


f2G 5 4 (2). 


The set of images y = f(x) of all points in the domain constitutes the range of 
f, a subset of Y indicated by im f. 

The graph of f is the subset I’(f) of the Cartesian product X x Y made of 
pairs (x, f(x)) when zx varies in the domain of f, i-e., 


WG) = 4, f@)yeX% x = ae dom ft (2.1) 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_2, 
© Springer International Publishing Switzerland 2015 


32 2 Functions 


Figure 2.1. Naive representation of a function using Venn diagrams 


In the sequel we shall consider maps between sets of numbers most of the time. 
If Y = R, the function f is said real or real-valued. If X = R, the function is 
of one real variable. Therefore the graph of a real function is a subset of the 
Cartesian plane R?. 


A remarkable special case of map arises when X = N and the domain contains 
a set of the type {n © N : n> no} for a certain natural number no > 0. Such a 
function is called sequence. Usually, indicating by a the sequence, it is preferable 
to denote the image of the natural number n by the symbol a, rather than a(n); 
thus we shall write a: n +> a,. A common way to denote sequences is {an }n>no 
(ignoring possible terms with n < no) or even {ay }. 


Examples 2.1 


Let us consider examples of real functions of real variable. 


i) f: RR, f(x) = az +6 (a,b real coefficients), whose graph is a straight line 
(Fig. 2.2, top left). 

ii) f: ROR, f(x) = 2, whose graph is a parabola (Fig. 2.2, top right). 

iii) f: R\{0} CRO R, f(x) = 4, has a rectangular hyperbola in the coordinate 


~~ 2 


system of its asymptotes as graph (Fig. 2.2, bottom left). 
iv) A real function of a real variable can be defined by multiple expressions on 
different intervals, in which case is it called a piecewise function. An example 
is given by f : [0,3] > R 

BH iO =e <1, 

fajH=-4ae Ul< eo, (2.2) 

x-1 if2<a<3, 

drawn in Fig. 2.2, bottom right. 


2.1 Definitions and first examples 33 


—2 -1 0O 1 2 


r A 
3 
| : 2 
1 4 
-1 l 
0 1 2 3 


Figure 2.2. Graphs of the maps f(a) = 2x—2 (top left), f(x) = x? (top right), f(x) = - 
(bottom left) and of the piecewise function (2.2) (bottom right) 


Among piecewise functions, the following are particularly important: 


v) the absolute value (Fig. 2.3, top left) 


+1 ifa>0, 
O° ita =O, 


= Ite): 


34 2 Functions 


0 
A A 
2 eo— 
1 
o—_——S ee od 
—2 -1 0 1 2 8 —2 -1 0 1 2 8 
eo—4 —1 
—o —2 


Figure 2.3. Clockwise from top left: graphs of the functions: absolute value, sign, man- 
tissa and integer part 


viii) the mantissa (Fig. 2.3, bottom right) 


(the property of the floor function implies 0 < M(a) < 1). 


Let us give some examples of sequences now. 


ix) The sequence 


n 
n= 2.3 
is defined for all n > 0. The first few terms read 


1 2 - 3 
ap = 0, a= 5 = 0.5, OS eee am 


Its graph is shown in Fig. 2.4 (top left). 


Gn = (1 + ~) (2.4) 


is defined for n > 1. The first terms are 


9 64 — 625 
q=2; t= i 2.25, ag= 7 2.37037, a@4= 356 2.44140625. 


x) The sequence 


Fig. 2.4 (top right) shows the graph of such sequence. 


2.1 Definitions and first examples 35 


A A 


3 
1 ee eR A 
1 e @ @ 
ol 1 2 3 4 5 6 o| 1 2 3 4 5 6 
A 
120 ° 4 
13 e e e 
o) 1 2 3 4 5 6 
24 . —lt+ oe e e 
6 e 
o__@ _ > 
0 12 3 4 «5 


Figure 2.4. Clockwise: graphs of the sequences (2.3), (2.4), (2.6), (2.5) 


xi) The sequence 
Qn =n! (2.5) 
associates to each natural number its factorial, defined in (1.9). The graph of 


this sequence is shown in Fig.2.4 (bottom left); as the values of the sequence 
grow rapidly as n increases, we used different scalings on the coordinate axes. 


xii) The sequence 


a, = (-1)" = (n > 0) (2.6) 


—1 ifn is odd, 


has alternating values +1 and —1, according to the parity of n. The graph of the 
sequence is shown in Fig. 2.4 (bottom right). 


{ +1 ifn is even, 


At last, here are two maps defined on R? (functions of two real variables). 


f:R? OR, f(z,y) = V22+y? 


maps a generic point P of the plane with coordinates (x, y) to its distance from 
the origin. 


xiii) The function 


xiv) The map 


associates to a point P the point P’ symmetric to P with respect to the bisectrix 
of the first and third quadrants. 


36 2 Functions 


Consider a map from X to Y. One should take care in noting that the symbol 
for an element of X (to which one refers as the independent variable) and the 
symbol for an element in Y (dependent variable), are completely arbitary. What 
really determines the function is the way of associating each element of the domain 
to its corresponding image. For example, if x,y, z,t are symbols for real numbers, 
the expressions y = f(x) = 32, x = f(y) = 3y, or z = f(t) = 3t denote the same 
function, namely the one mapping each real number to its triple. 


2.2 Range and pre-image 


Let A be a subset of X. The image of A under f is the set 
f(A) ={fla) : ee A} Cimf 


of all the images of elements of A. Notice that f(A) is empty if and only if A 
contains no elements of the domain of f. The image f(X) of the whole set X is 
the range of f, already denoted by im f. 

Let y be any element of Y; the pre-image of y by f is the set 


f(y) ={vedomf : f(x) =y} 


of elements in X whose image is y. This set is empty precisely when y does not 
belong to the range of f. If B is a subset of Y, the pre-image of B under f is 
defined as the set 


f-"(B) = {xe dom f : f(x) € B}, 


union of all pre-images of elements of B. 


It is easy to check that A C f~1(f(A)) for any subset A of dom f, and 
f(f-1(B)) = Bnimf C B for any subset B of Y. 


Example 2.2 
Let f: R—R, f(x) = x7. The image under f of the interval A = [1,2] is the 
interval B = [1,4]. Yet the pre-image of B under f is the union of the intervals 
[—2, —1] and [1,2], namely, the set 
f-\(B) ={xeER : 1< |z| < 2} 
(see Fig. 2.5). O 


The notions of infimum, supremum, maximum and minimum, introduced in 
Sect. 1.3.1, specialise in the case of images of functions. 


2.2 Range and pre-image 37 
y = f(z) 


S ara 
f-"(B) 


Figure 2.5. Image (left) and pre-image (right) of an interval relative to the function 


f(e) =" 


Definition 2.3 Let f be a real map and A a subset of dom f. One calls 
supremum of f on A (or in A) the supremum of the image of A under f 


SNAG ae eee ened) Vertes 


Then f is bounded from above on A if the set f(A) is bounded from above, 
or equivalently, if sup f(a) < +00. 
rEA 


If sup f(x) is finite and belongs to f(A), then it is the maximum of this set. 
rEA 


This number is the maximum value (or simply, the maximum) of f on 
A and is denoted by max (2). 
x 


The concepts of infimum and of minimum of f on A are defined similarly. 
Eventually, f is said bounded on A if the set f(A) is bounded. 


At times, the shorthand notations sup, f, max, f, et c. are used. 


The maximum value M = max, f of f on the set A is characterised by the 
conditions: 


i) M is a value assumed by the function on A, ie., 
there exists xy € A such that f(xar) = M; 
ii) M is greater or equal than any other value of the map on A, so 


for any x € A, f(x) < M. 


Example 2.4 
Consider the function f(x) defined in (2.2). One verifies easily 


aa ss Haye 0, Eas aa 


The map does not assume the value 1 anywhere in the interval [1,3], so there is 
no minimum on that set. 


38 2 Functions 


2.3 Surjective and injective functions; inverse function 


A map with values in Y is called onto if im f = Y. This means that each y € Y 
is the image of one element x € X at least. The term surjective (on Y) has the 
same meaning. For instance, f : R > R, f(x) = ax + b with a F 0 is surjective 
on R, or onto: the real number y is the image of x = yoo On the contrary, the 
function f :R > R, f(x) = x? is not onto, because its range coincides with the 
interval [0, +00). 


A function f is called one-to-one (or 1-1) if every y € im f is the image of a 
unique element x € dom f. Otherwise put, if y = f(xi) = f(x2) for some elements 
£1, %2 in the domain of f, then necessarily x1 = x2. This, in turn, is equivalent to 


ti xxtq => flat1) # f(x2) 


for all 71,72 € dom f (see Fig. 2.6). Again, the term injective may be used. If a 
map f is one-to-one, we can associate to each element y in the range the unique x 
in the domain with f(x) = y. Such correspondence determines a function defined 
on Y and with values in X, called inverse function of f and denoted by the 
symbol f~+. Thus 

c= f(y) = y=f(2) 
(the notation mixes up deliberately the pre-image of y under f with the unique 


element this set contains). The inverse function f~! has the image of f as its 
domain, and the domain of f as range: 


dom f~' =imf, im f—' = dom f. 


Figure 2.6. Representation of a one-to-one function and its inverse 


2.3 Surjective and injective functions; inverse function 39 


A one-to-one map is therefore invertible; the two notions (injectivity and invert- 
ibility) coincide. 

What is the link between the graphs of f, defined in (2.1), and of the inverse 
function f~!? One has 


r(f-) ={(y, f(y) €¥ x X : yedomf7}} 
={(f(c),2)eY xX : xedomf}. 


Therefore, the graph of the inverse map may be obtained from the graph of f by 
swapping the components in each pair. For real functions of one real variable, this 
corresponds to a reflection in the Cartesian plane with respect to the bisectrix 
y = x (see Fig. 2.7: a) is reflected into b)). On the other hand, finding the explicit 
expression x = f~'(y) of the inverse function could be hard, if possible at all. 

Provided that the inverse map in the form 2 = f~!(y) can be determined, often 
one prefers to denote the independent variable (of f~') by x, and the dependent 
variable by y, thus obtaining the expression y = f~+(a). This is merely a change 
of notation (see the remark at the end of Sect. 2.1). The procedure allows to draw 
the graph of the inverse function in the same frame system of f (see Fig. 2.7, from 
b) to c)). 


dom f~' x 


Figure 2.7. From the graph of a function to the graph of its inverse 


40 2 Functions 


Examples 2.5 


i) The function f : RR — R, f(x) = ax +6 is one-to-one for all a ¥ 0 (in fact, 
f(x1) = f(t2) > ax, = are v1 = £2). Its inverse is 2 = f—'(y) = <2, or 
y= f(a) = =. 


ii) The map f: RR, f(x) = 2? is not one-to-one because f(x) = f(—x) for 
any real x. Yet if we consider only values > 0 for the independent variable, i.e., 
if we restrict f to the interval [0,+0o), then the function becomes 1-1 (in fact, 
f(a1) = f(a2) > 2? — 23 = (a1 — 22)(21 +22) =0 => 21 = 22). The inverse 
function « = f~'(y) = ,/¥ is also defined on [0,-+0o). Conventionally one says 
that the ‘squaring’ map y = 2” has the function ‘square root’ y = \/z for inverse 
(on [0,+00)). Notice that the restriction of f to the interval (—oo, 0] is 1-1, too; 
the inverse in this case is y = —/Z. 


iii) The map f : RR, f(x) = 2° is one-to-one. In fact f(x1) = f(x2) => 


a? — 23 = (1 — £)(x? + 2122 + £2) = 0 £1 = £2 since x7 + 2122 + 22 = 


s [x7 + 23+ (x1 + x2)?] > 0 for any x; 4 x2. The inverse function is the ‘cubic 
root’ y = /2, defined on all R. 


As in Example ii) above, if a function f is not injective over the whole domain, 
it might be so on a subset A C dom f. The restriction of f to A is the function 


Oe toe at such that file) =flx), Vae A, 
and is therefore invertible. 


Let f be defined on X with values Y. If f is one-to-one and onto, it is called 
a bijection (or bijective function) from X to Y. If so, the inverse map f~! is 
defined on Y, and is one-to-one and onto (on X); thus, f~! is a bijection from Y 
to X. 

For example, the functions f(x) = ar +b (a 40) and f(x) = 2° are bijections 
from R to itself. The function f(z) = x? is a bijection on [0, +00) (i-e., from 
[0, +00) to [0, +00)). 

If f is a bijection between X and Y, the sets X and Y are in bijective cor- 
rispondence through f: each element of X is assigned to one and only one element 
of Y, and vice versa. The reader should notice that two finite sets (i.e., containing 
a finite number of elements) are in bijective correspondence if and only if they 
have the same number of elements. On the contrary, an infinite set can correspond 
bijectively to a proper subset; the function (sequence) f : NN, f(n) = 2n, for 
example, establishes a bijection between N and the subset of even numbers. 


To conclude the section, we would like to mention a significant interpretation 
of the notions of 1-1, onto, and bijective maps just introduced. Both in pure Math- 
ematics and in applications one is frequently interested in solving a problem, or 
an equation, of the form 


f(z) =y, 


2.4 Monotone functions Al 


where f is a suitable function between two sets X and Y. The quantity y represents 
the datum of the problem, while x stands for the solution to the problem, or the 
unknown of the equation. For instance, given the real number y, find the real 
number x solution of the algebraic equation 


eta? — Yr=y. 


Well, to say that f is an onto function on Y is the same as saying that the problem 
or equation of concern admits at least one solution for each given y in Y; asking f 
to be 1-1 is equivalent to saying the solution, if it exists at all, is unique. Eventually, 
f bijection from X to Y means that for any given y in Y there is one, and only 
one, solution x € X. 


2.4 Monotone functions 


Let f be a real map of one real variable, and J the domain of f or an interval 
contained in the domain. We would like to describe precisely the situation in which 
the dependent variable increases or decreases as the independent variable grows. 
Examples are the increase in the pressure of a gas inside a sealed container as 
we raise its temperature, or the decrease of the level of fuel in the tank as a car 
proceeds on a highway. We have the following definition. 


Definition 2.6 The function f is increasing on I if, given elements x1, X2 
in I with x1 < x2, one has f(x1) < f(x2); in symbols 


V21,22 € L, %<Xo = f(a1) Sq fito). (er) 
The function f is strictly increasing on I if 


Ne € I, %1< 22 > ae < f (x2) ‘ (2.8) 


Figure 2.8. Strictly increasing (left) and decreasing (right) functions on an interval I 


42 2 Functions 


If a map is strictly increasing then it is increasing as well, hence condition (2.8) is 
stronger than (2.7). 

The definitions of decreasing and strictly decreasing functions on J are 
obtained from the previous definitions by reverting the inequality between f (2x1) 
and f (x2). 

The function f is (strictly) monotone on / if it is either (strictly) increasing 
or (strictly) decreasing on J. An interval J where f is monotone is said interval 
of monotonicity of f. 


Examples 2.7 


i) The map f: R- R, f(x) = ax +64, is strictly increasing on R for a > 0, 
constant on R for a = 0 (hence increasing as well as decreasing), and strictly 
decreasing on R when a < 0. 

ii) The map f: R-R, f(x) = 2? is strictly increasing on I = [0, +00). Taking 
in fact two arbitrary numbers 21,272 > 0 with x7, < x2, we have ae <41%2 < a. 
Similarly, f is strictly decreasing on (—oo, 0]. It is not difficult to check that 
all functions of the type y = x”, with n > 4 even, have the same monotonic 
behaviour as f (Fig. 2.9, left). 


iii) The function f : RR, f(x) = 2° strictly increases on R. All functions like 
y = x” with n odd have analogous behaviour (Fig. 2.9, right). 


iv) Referring to Examples 2.1, the maps y = [2] and y = sign(x) are increasing 
(though not strictly increasing) on R. 

The mantissa y = M(x) of x, instead, is not monotone on R; but it is nevertheless 
strictly increasing on each interval [n,n +1), n € Z. 


Figure 2.9. Graphs of some functions y = x” with n even (left) and n odd (right) 


2.5 Composition of functions 43 


Now to a simple yet crucial result. 


Proposition 2.8 If f is strictly monotone on its domain, then f is one-to- 


one. 


Proof. To fix ideas, let us suppose f is strictly increasing. Given 71,72 € dom f 
with 21 # 22, then either x; < x2 or ©2 < 2}. In the former case, using 
(2.8) we obtain f(x1) < f(x2), hence f(x1) € f (x2). In the latter case the 
same conclusion holds by swapping the roles of 7; and x2. O 


Under the assumption of the above proposition, there exists the inverse function 
f— then; one can comfortably check that f~! is also strictly monotone, and in the 
same way as f (both are strictly increasing or strictly decreasing). For instance, 
the strictly increasing function f : [0,-+oo) — [0,-+00), f(x) = x? has, as inverse, 
the strictly increasing function f~1 : [0, +00) — [0,+00), f-1(x) = Vz. 


The logic implication 
f is strictly monotone on its domain => f is one-to-one 


cannot be reversed. In other words, a map f may be one-to-one without increasing 
strictly on its domain. For instance f : R — R defined by 


1 

— ite 0, 
a4 @ 
0 at a=, 


is one-to-one, actually bijective on R, but it is not strictly increasing, nor strictly 
decreasing or R. We shall return to this issue in Sect. 4.3. 


A useful remark is the following. The sum of functions that are similarly mono- 
tone (i.e., all increasing or all decreasing) is still a monotone function of the same 
kind, and turns out to be strictly monotone if one at least of the summands is. 
The map f(z) = x° + 2, for instance, is strictly increasing on R, being the sum 
of two functions with the same property. According to Proposition 2.8 f is then 
invertible; note however that the relation f(x) = y cannot be made explicit in the 
form @= 7 *(y). 


2.5 Composition of functions 


Let X,Y, Z be sets. Suppose f is a function from X to Y, and g a function from 
Y to Z. We can manifacture a new function h from X to Z by setting 


h(x) = g(f(@)). (2.9) 


The function h is called composition of f and g, sometimes composite map, 
and is indicated by the symbol h = go f (read ‘g composed (with) f’). 


44 2 Functions 


Example 2.9 


Consider the two real maps y = f(x) =x —3 and z = g(y) = y? + 1 of one real 
variable. The composition of f and g reads z = h(x) = go f(x) = (wx-3)? +1.0 


Bearing in mind definition (2.9), the domain of the composition go f is de- 
termined as follows: in order for x to belong to the domain of go f, f(x) must be 
defined, so x must be in the domain of f; moreover, f(x) has to be a element of 
the domain of g. Thus 


x €domgof — xé€domf and f(x) € domg. 


The domain of go f is then a subset of the domain of f (see Fig. 2.10). 


Examples 2.10 


2 
i) The domain of f(x) = rT is R \ {1}, while g(y) = \/y is defined on the 
4 i — 


2 
interval [0,-+oo). The domain of go f(x) = a consists of the z 4 1 such 
2G. — 
2 
that a > 0; hence, dom go f = [—2, +00) \ {1}. 
OG: — 


ii) Sometimes the composition go f has an empty domain. This happens for 


1 
79 eae (notice f(x) < 1) and g(y) = Vy — 5 (whose domain 
a 
is [5, +00)). 7 


instance for f(x) 


Figure 2.10. Representation of a composite function via Venn diagrams. 


2.5 Composition of functions 45 


The operation of composition is not commutative: if go f and f og are both 
defined (for instance, when X = Y = Z), the two composites do not coincide in 


for which go f(x) = i - 
te 


1 

general. Take for example f(x) = — and g(x) = 7 
a 

but fog(x) =1+4+2. 


+4’ v] 


If f and g are both one-to-one (or both onto, or both bijective), it is not difficult 
to verify that go f has the same property. In the first case in particular, the formula 


(gof)*=frtog™ 


holds. 


Moreover, if f and g are real monotone functions of real variable, go f too will 
be monotone, or better: g o f is increasing if both f and g are either increasing 
or decreasing, and decreasing otherwise. Let us prove only one of these properties. 
Let for example f increase and g decrease; if x1 < x2 are elements in domg o f, 
the monotone behaviour of f implies f(a.) < f(#2); now the monotonicity of g 
yields g(f(x1)) > g(f(x2)), so go f is decreasing. 


We observe finally that if f is a one-to-one function (and as such it admits 
inverse f—+), then 
frof(a)=f"(f(@z))=2, Veedomf, 
fof Y=fF w)=y Vy eimf. 


Calling identity map on a set X the function idx : X + X such that idx(#) =a 
for all x € X, we have f-'o f =iddomy and fo f~!=idimy. 


2.5.1 Translations, rescalings, reflections 


Let f be a real map of one real variable (for instance, the function of Fig. 2.11). 
Fix a real number c ¥ 0, and denote by t, : R > R the function t.(x) = «+c. 
Composing f with t, results in a translation of the graph of f: precisely, the 


Figure 2.11. Graph of a function f(z) 


46 2 Functions 


graph of the function fot.(x) = f(a+c) is shifted horizontally with respect to the 
graph of f: towards the left if c > 0, to the right if c < 0. Similarly, the graph of 
t.o f(x) = f(x) +c is translated vertically with respect to the graph of f, towards 
the top for c > 0, towards the bottom if c < 0. Fig. 2.12 provides examples of these 
situations. 


Fix a real number c > 0 and denote by s, : R > R the map s,(x) = cx. The 
composition of f with s, has the effect of rescaling the graph of f. Precisely, 
if c > 1 the graph of the function f o s.(x) = f(cx) is ‘compressed’ horizontally 
towards the y-axis, with respect to the graph of f; if 0 < c < 1 instead, the 
graph ‘stretches’ away from the y-axis. The analogue effect, though in the vertical 
direction, is seen for the function s, 0 f(x) = cf(x): here c > 1 ‘spreads out’ the 
graph away from the x-axis, while 0 < c < 1 ‘squeezes’ it towards the axis, see 
Fig. 2.13. 


Notice also that the graph of f(—z) is obtained by reflecting the graph of f(x) 
along the y-axis, like in front of a mirror. The graph of f (|x|) instead coincides 
with that of f for x > 0, and for x < 0 it is the mirror image of the latter with 
respect to the vertical axis. At last, the graph of | f(a)| is the same as the graph of 
f when f(x) > 0, and is given by reflecting the latter where f(x) < 0, see Fig. 2.14. 


y= f(@+c), c<0 


Figure 2.12. Graphs of the functions f(x +c) (c > 0: top left, c < 0: top right), and 
f(x) +c (c < 0: bottom left, c > 0: bottom right), where f(x) is the map of Fig. 2.11 


2.6 Elementary functions and properties 47 


c>1 ry 


Figure 2.13. Graph of f(cx) with c > 1 (top left), 0 < ¢ < 1 (top right), and of cf(x) 
with c > 1 (bottom left), 0 < c¢< 1 (bottom right) 


2.6 Elementary functions and properties 


We start with a few useful definitions. 


Definition 2.11 Let f : dom f CR—-R be a map with a symmetric domain 
with respect to the origin, hence such that x € dom f forces —x € dom f as 
well. The function f is said even if f(—x) = f(x) for all x € dom f, odd if 
f(—a2) =—f(z) for all x € dom f. 


The graph of an even function is symmetric with respect to the y-axis, and that 
of an odd map symmetric with respect to the origin. If f is odd and defined in the 
origin, necessarily it must vanish at the origin, for f(0) = —f(0). 


Definition 2.12 A function f : dom f C R — R is said periodic of period 
p (with p > 0 real) if dom f is invariant under translations by +p (i.e., if 


xtp €domf for all x € domf) and if f(x +p) = f(x) holds for any 
x € dom f. 


48 2 Functions 


y = f(-2) y = f(\2\) 


y = |f(lxl)| 


Figure 2.14. Clockwise: graph of the functions f(—2), f(|z|), |f(jzl)|, |f(x)| 


One easily sees that an f periodic of period p is also periodic of any multiple 
mp (m € N \ {0}) of p. If the smallest period exists, it goes under the name 
of minimum period of the function. A constant map is clearly periodic of any 
period p > 0 and thus has no minimum period. 


Let us review now the main elementary functions. 


2.6.1 Powers 


These are functions of the form y = 7°. The case a = 0 is trivial, giving rise to the 
constant function y = z° = 1. Suppose then a > 0. For a=n € N \ {0}, we find 
the monomial functions y = x” defined on R, already considered in Example 2.7 ii) 
and iii). When n is odd, the maps are odd, strictly increasing on R and with range 
R (recall Property 1.8). When n is even, the functions are even, strictly decreasing 
on (—oo, 0] and strictly increasing on [0, +00); their range is the interval [0, +00). 

Consider now the case a > 0 rational. If «a = + where m € N\ {0}, we define a 
function, called mth root of « and denoted y = «!/" = ¥/z, inverting y = x”. It 
has domain R if m is odd, [0, +00) if m is even. The mth root is strictly increasing 
and ranges over R or [0, +00), according to whether m is even or odd respectively. 

nm 


In general, fora = = € Q, n,m € N \ {0} with no common divisors, the 
function y = «”/™ is defined as y = (a”)'/™ = */z™. As such, it has domain R 


2.6 Elementary functions and properties 49 


A A ry 


Figure 2.15. Graphs of the functions y = x°/? (left), y = 2*/3 (middle) and y = «°/? 
(right) 


if m is odd, [0,+00) if m is even. It is strictly increasing on [0,+00) for any n, 
m, while if m is odd it strictly increases or decreases on (—co, 0] according to the 
parity of n. 

Let us consider some examples (Fig. 2.15). The map y = x°/3, defined on R, 
is strictly increasing and has range R. The map y = x*/? is defined on R, strictly 
decreases on (—oo, 0] and strictly increases on [0, +00), which is also its range. To 
conclude, y = «°/? is defined only on [0,+00), where it is strictly increasing and 
has [0, +00) as range. 

Let us introduce now the generic function y = x® with irrational a > 0. To this 
end, note that if a@ is a non-negative real number we can define the power a® with 
a €R,\Q, starting from powers with rational exponent and exploiting the density 
of rationals inside R. If a > 1, we can in fact define a® = sup{a"/™ | = < a}, 
while for 0 < a < 1 we set a® = inf{a”/™ | 2 < a}. Thus the map y = x® with 
a € R, \ Qis defined on [0, +00), and one proves it is there strictly increasing and 
its range is [0, +00). 

Summarising, we have defined y = x for every value a > 0. They are all 
defined a least on [0, +00), interval on which they are strictly increasing; moreover, 
they satisfy y(0) = 0, y(1) = 1. It will turn out useful to remark that if a < 6, 


(ee 2 AN, sew = ee il, (eee? ore = (2.10) 


(see Fig. 2.16). 


0 1 
Figure 2.16. Graphs of y = x°, x > 0 for some a > 0 


50 2 Functions 


Figure 2.17. Graphs of y = x® for a two values a < 0 


a 


1 
At last, consider the case of a < 0. Set y = x® = —— by definition. Its 
£ 


domain coincides with the domain of y = x«~® minus the origin. All maps are 


strictly decreasing on (0, +00), while on (—oo, 0) the behaviour is as follows: writing 
a = —= with m odd, the map is strictly increasing if n is even, strictly decreasing 
if n is odd, as shown in Fig. 2.17. In conclusion, we observe that for every a ¥ 0, 
the inverse function of y = «®, where defined, is y = x!/@. 


2.6.2 Polynomial and rational functions 


A polynomial function, or simply, a polynomial, is a map of the form P(x) = 
Gnx" +---+a,x+ ag with a, # 0; n is the degree of the polynomial. Such a map 
is defined over all R; it is even (resp. odd) if and only if all coefficients indexed by 
even (odd) subscripts vanish (recall that 0 is an even number). 


P 
A rational function is of the kind R(x) = at where P and Q are poly- 
x 
nomials. If these have no common factor, the domain of the rational function will 
be R without the zeroes of the denominator. 


2.6.3 Exponential and logarithmic functions 


Let a be a positive real number. According to what we have discussed previously, 
the exponential function y = a” is defined for any real number 2; it satisfies 
y(0) =a? = 

If a > 1, the exponential is strictly increasing; if a = 1, this is the constant 
map 1, while if a < 1, the function is strictly decreasing. When a ¥ 1, the range 
s (0,+00) (Fig. 2.18). Recalling a few properties of powers is useful at this point: 
for any x,yE€R 


2.6 Elementary functions and properties 51 


A A 


8 

4 

2 

1 1 

o| 123 °° 0 - 


Figure 2.18. Graphs of the exponential functions y = 2” (left) and y = ($)” (right) 


When a # 1, the exponential function is strictly monotone on R, hence invertible. 
The inverse y = log, x is called logarithm, is defined on (0, -+oo) and ranges over 
R; it satisfies y(1) = log, 1 = 0. The logarithm is strictly increasing if a > 1, 
strictly decreasing if a < 1 (Fig. 2.19). The previous properties translate into the 
following: 


log, (vy) =log,z+log,y, Vx,y>0, 


log, — ep ap = OR, ee SN 
7] 


log (47) — vlogs, V2 50, vue R- 


Figure 2.19. Graphs of y = log, x (left) and y = log, /2 x (right) 


2.6.4 Trigonometric functions and inverses 


Denote here by X,Y the coordinates on the Cartesian plane R?, and consider the 
unit circle, i-e., the circle of unit radius centred at the origin O = (0,0), whose 


52 2 Functions 


equation reads X?2 + Y? = 1. Starting from the point A = (1,0), intersection 
of the circle with the positive x-axis, we go around the circle. More precisely, 
given any real x we denote by P(x) the point on the circle reached by turning 
counter-clockwise along an arc of length x if x > 0, or clockwise by an arc of 
length —2 if « < 0. The point P(x) determines an angle in the plane with vertex 
O and delimited by the outbound rays from O through the points A and P(z) 
respectively (Fig. 2.20). The number x represents the measure of the angle in 
radians. The one-radian angle is determined by an arc of length 1. This angle 
measures 360 = 57.2957795--- degrees. Table 2.1 provides the correspondence 
between degrees and radians for important angles. Henceforth all angles shall be 
expressed in radians without further mention. 


ree [>>] |= =] =] ae 


di a |u| awl) aw | Qn | 380 | OF 3m | 5 
ceoccaiaciae) | al (tm (Re (a er 


Table 2.1. Degrees versus radians 


Increasing or decreasing by 27 the length x has the effect of going around the 
circle once, counter-clockwise or clockwise respectively, and returning to the initial 
point P(x). In other words, there is a periodicity 


Px I 27) = P(x), Va ER. (2.11) 


Denote by cos x (‘cosine of x’) and sin x (‘sine of x’) the X- and Y-coordinates, 
respectively, of the point P(x). Thus P(x) = (cosa,sin x). Hence the cosine func- 
tion y = cosx and the sine function y = sinz are defined on R and assume all 


Figure 2.20. The unit circle 


2.6 Elementary functions and properties 53 


Figure 2.21. Graph of the map y = sin 


values of the interval [—1, 1]; by (2.11), they are periodic maps of minimum period 
27. They satisfy the crucial trigonometric relation 


cos* x + sin? x = 1, Vac € R. (2.12) 


It is rather evident from the geometric interpretation that the sine function 
is odd, while the cosine function is even. Their graphs are represented in Figures 
2.21 and 2.22. 

Important values of these maps are listed in the following table (where k is any 
integer): 


sing =O for ¢= ki, cosx=0 _ for z= othr, 
sine—=1 for w= 5 +2kr, cosx=1 for #®=2kr, 


sinx = —1 for w= — > + kn, cosx=-—1 for cx=7+2k7. 


Figure 2.22. Graph of the map y = cosz 


54 2 Functions 
Concerning monotonicity, one has 


strictly increasing on | — > + 2k, - + 2kr | 


3 
strictly decreasing on E + 2kr, > + 2kr| ; 


y=sinz is 


strictly decreasing on [2ka, 7a + 2k] 
y=cosx is 
strictly increasing on [a + 2ka,2a 4+ 2k]. 


The addition and subtraction formulas are relevant 


sin(a +) = sinacos 6 + cosasin 8 


cos(a + 8) = cosacos @ F sinasin f. 


Suitable choices of the arguments allow to infer from these the duplication formulas 


sin 27 — 2 sin cos 7, Cos2t — cos 7 — ll (2.13) 


rather than 


sing — siny = 2 sin z 5 (2.14) 


cos z — cosy = —2sin = 5 in (2.15) 


or the following 


sin(a +7) = —sing, cos(x + 7) = —cosz, (2.16) 


(2.17) 


In the light of Sect.2.5.1, the first of (2.17) tells that the graph of the cosine is 
obtained by left-translating the sine’s graph by 7/2 (compare Figures 2.21 and 
2.22). 


The tangent function y = tanz (sometimes y = tga) and the cotangent 
function y = cotanz (also y = ctg x) are defined by 


sin x COS X& 
; cotanz = — : 
COS & sin x 


Because of (2.16), these maps are periodic of minimum period 7, and not 27. The 
tangent function is defined on R\{$+ka : k € Z}, it is strictly increasing on the 


2.6 Elementary functions and properties 55 


A \ 1 , A 


3 
57 


| 
| 
| 
| 
| 
| 
| 
I 
| 
i 
| 
| 
| 
f 
t a 
| 
| 
| 
| 
| 
| 
| 
i 
| 
| 
| 
| 
| 
| 


wlA 


TT 


i] i} 
i] i} 
1 i} 
i] | 
i] i} 
i] | 
i] if 
i] | 
1 i} 
1 | 
1 i} 
1 | 
! 
a 
2 | T 
1 i} 
i] i} 
i] i} 
1 i} 
1 if 
i] | 
i] i} 
i) | 
1 if 
i] | 
i] i} 
i) | 


Figure 2.23. Graphs of the functions y = tan x (left) and y = cotan x (right) 


intervals (—$ +k, $ +k) where it assumes every real number as value. Similarly, 
the cotangent function is defined on R\ {ka : k € Z}, is strictly decreasing on 
the intervals (ka,7 + k7), on which it assumes every real value. Both maps are 
odd. Their respective graphs are found in Fig. 2.23. 

Recall that tan xz expresses geometrically the Y-coordinate of the intersection 
point Q(x) between the ray from the origin through P(x) and the vertical line 
containing A (Fig. 2.20). 


The trigonometric functions, being periodic, cannot be invertible on their whole 
domains. In order to invert them, one has to restrict to a maximal interval of strict 
monotonicity; in each case one such interval is chosen. 

The map y = sin is strictly increasing on [-4, 4]. The inverse function on 


this particular interval is called inverse sine or arcsine and denoted y = arcsinz 


—1 0 1 


Figure 2.24. Graphs of y = arcsin x (left) and y = arccos x (right) 


56 2 Functions 


or y = asin; it is defined on [—1,1], everywhere strictly increasing and ranging 
over the interval [—4, 3]. This fiom is odd (Fig. 2.24, left). 

Similarly, the function y = cos x is strictly decrandine on the interval [0,7]. By 
restricting it to this interval one can define the inverse cosine, or arccosine, 
y = arccosx or y = acosz on [—1, 1], which is everywhere strictly decreasing and 
has [0,7] for range (Fig. 2.24, right). 

The function y = tan is strictly increasing on (—5, 4). There, the inverse 
is called inverse tangent, or arctangent, and denoted y = arctanz or y = 
a (also arctg x). It is strictly increasing on its entire domain R, and has range 
(—4, 5). Also this is an odd map (Fig. 2.25, left). 

In the analogous way the inverse cotangent, or arccotangent, y = arccotan x 
is the inverse of the cotangent on (0,7) (Fig. 2.25, right). 


Figure 2.25. Graphs of y = arctan x (left) and y = arccotanz (right) 


2.7 Exercises 


1. Determine the domains of the following functions: 


3 1 a/p2 — = 
) 1O= ae b)] se) = 
—- .. ife 0 
c) f(x) = log(x? - 2) f(a) = 4 2rd 


ev@tl  ifa <0 
2. Determine the range of the following functions: 
1 
[| fO=s [b)] f(@) = Ve ¥2-1 


log x ia 1, 
6. fase d) f(x) = { 
—22-5 ifa<1 


Find domain and range for the map f(x) = /cosx — 1 and plot its graph. 


2.7 Exercises 57 


Let f(x) = —log(x — 1); determine f—1+((0,-+00)) and f—1((—oo, —1)). 


5. Sketch the graph of the following functions indicating the possible symmetries 
and/or periodicity: 


a) f(x) = VJ1-|z| b) f(x) =1+ cos2z 
P(e 4 
c) fw) = tan (x + 5) d) riey= 4" oa as 
—x£ if 2 >1 
6. Using the map f(x) in Fig. 2.26, draw the graphs of 
f(z)-1, f(+3), f(@-1), fle), f(-s), |F(x)]. 


Check that the function f : R > R defined by f(x) = x? — 22 +5 is not 
invertible. Determine suitable invertible restrictions of f and write down the 
inverses explicitly. 


Determine the largest interval I where the map 
f(x) = V|a — 2] — |x| +2 


is invertible, and plot a graph. Write the expression of the inverse function of 
f restricted to I. 


Verify that f(x) = (1 + 8x)(2x — |x — 1|), defined on [0,+00), is one-to-one. 
Determine its range and inverse function. 


10. Let f and g be the functions below. Write the expressions for go f, f og, and 
determine the composites’ domains. 


f(x) =27-—3 and g(x) =log(1+72) 


b) f()=— and g(t) = VI= 


—1 


Figure 2.26. Graph of the function f in Exercise 6 


58 


11. 


2 Functions 


2e"7 +1 


Write h(a) = Grad 


function. 


as composition of the map f(x) = e” with some other 


Given f(x) = 2? -—32+2 and = g(x) = x? —5x +6, find the expressions 


and graphs of 
h(e) = min( f(x), 9(a)) and k(2) = max(h(¢), 0). 


2.7.1 Solutions 


1. Domains: 

a) dom f = R \ {-3, 2}. 

b) The conditions 7? — 3x — 4 > 0 and + 5 # O are necessary. The first is 
tantamount to (x + 1)(a — 4) > 0, hence x € (—oo, —1] U [4, +00); the second 
to « # —5. The domain of f is then 

dom f = (—oo, —5) U (—5, —1] U [4, +00). 

c) dom f = (—oo, 0) U (1, +00). 

d) In order to study the domain of this piecewise function, we treat the cases 
x > 0, x < 0 separately. 

For x > 0, we must impose 22+1#0,ie., 24 —}. Since —4 < 0, the function 
is well defined on x > 0. 

For « < 0, we must have x + 1 > 0, or x > —1. For negative x then, the 
function is defined on [—1,0). 

All in all, dom f = [—1, +00). 

2. Ranges: 

a) The map y = 2? has range [0,+00); therefore the range of y = x? 4 1 is 
[1, +00). Passing to reciprocals, the given function ranges over (0, 1]. 

b) The map is obtained by translating the elementary function y = \/x (whose 


range is [0,+00)) to the left by —2 (yielding y = x + 2) and then downwards 
by 1 (which gives y = a +2-— 1). The graph is visualised in Fig. 2.27, and 
clearly im f = [—1, +00). 

Alternatively, one can observe that 0 < x +2 < +oo implies —1 < /x+2- 
1 < +00, whence im f = [—1, +00). 


Figure 2.27. Graph of y= /x+2-1 


2.7 Exercises 09 
c) imf =(0,+00); —d) im f = (~7, too). 


3. Imposing cosx — 1 > 0 tells that cosx > 1. Such constraint is true only for 
xg = 2kn, k € Z, where the cosine equals 1; thus dom f = {7 € R: x = 2k7, k € Z} 
and im f = {0}. Fig. 2.28 provides the graph. 


67 Ar 27 O 2x 4n O67 


Figure 2.28. Graph of y = /cosxz — 1 


4. f~*((0,-+00)) = (1, 2] and f~1((—oo, -1]) = [e + 1, +00). 


5. Graphs and symmetries/periodicity: 


a) The function is even, not periodic and its graph is shown in Fig. 2.29 (top left). 
b) The map is even and periodic of period 7, with graph in Fig. 2.29 (top right). 
c) This function is odd and periodic with period 7, see Fig. 2.29 (bottom left). 

) 


d) The function has no symmetries nor a periodic behaviour, as shown in Fig. 2.29 


(bottom right). 


Wr 


Figure 2.29. Graphs relative to Exercises 5.a) (top left), 5.b) (top right), 5.c) (bottom 
left) and 5.d) (bottom right) 


60 2 Functions 


t Fat f(z+3) f { fie) 
0 | ; 
3 
3 = 0 1 4 
_) 
+ F(a) ) fa) | t [Fl 
0 3. : 
3 0 
| 0 3 


Figure 2.30. Graphs of Exercise 6 


6. See Fig. 2.30. 


7. The function represents a parabola with vertex (1,4), and as such it is not 
invertible on R, not being one-to-one (e.g., f(0) = f(2) = 5). But restricted to the 
intervals (—oo, 1] and [1, +00) separately, it becomes invertible. Setting 


fi = Tinea : (—oo, 1] —- [4, +00) ’ fo = F\[1,+00) : be +00) = [4, POO) 5 
we can compute 
fr’ : (4, too) > (—00, 1] , fx’ : [4, too) > [1, +00) 


explicitly. In fact, from x? — 2% + 5 — y = 0 we obtain 


G=1+ U= 4, 
With the ranges of f; 1 and fo in mind, swapping the variables x, y yields 


fil(z)=1-ve-4, 9 fy'(z)=1+Vve-4 


8. Since : 
2 ifa <0, 
fi@) =< V4—-2c¢ fO<a¢< 2, 
0 te 2, 


the required interval J is [0,2], and the graph of f is shown in Fig. 2.31. 
In addition f((0,2]) = [0,2], so f—1 : [0,2] — [0,2]. By putting y = /4—2z we 


obtain 7 = a which implies f~1(x) = 2 — aa. 


2.7 Exercises 61 


0 2 


Figure 2.31. Graph of y = \/|x — 2| — |z| +2 


9. We have 
9x? — 1 tO<e7=< 1, 
f(z) = 


3827 +4¢+1 ifg>1 
and the graph of f is in Fig. 2.32. 


The range of f is [—1,-+00). To determine f~! we discuss the cases 0 < x < 1 and 
x > 1 separately. For 0 < x < 1, we have -—1 < y < 8 and 


1 
y = 9a? -1 = L= 1. 
For x > 1, we have y > 8 and 
—2+ /sy+1 
y = 327 +4¢+1 = ee 


—1 


Figure 2.32. Graph of y = (1+ 3x)(2ax — |x — 1]) 


62 2 Functions 


Thus 
x+1 


ifx>8. 


10. Composite functions: 


a) As go f(x) = g(f(x)) = g(x? — 3) = log(1 + x? — 3) = log(x? — 2), it follows 
dom go f = {x ER: 2? —2 > 0} = (—00, —V2) U (V2, +00) 
We have f 0 g(x) = f(g(x)) = f(log(1 + x)) = (log(1 + x))? — 3, so 
dom fog = {x €R:1+2 > 0} = (—1,+00). 
bh ese) = a nd domgo f = (1, 2]; 
7 2g):= SS and dom f o g = (—on, 2]. 
ii.G\= nano =Soer). 


12. After drawing the parabolic graphs f(x) and g(x) (Fig. 2.33), one sees that 


h(x) g*—38e2+2 ifa<2, 
t~)= 
a*—5et+6 ife>2, 


iS) 


1 2 3 


Figure 2.33. Graphs of the parabolas f(x) = 2? — 3a + 2 and g(x) = 2” — 54 +6 


2.7 Exercises 


1 3 


Figure 2.34. Graphs of the maps h (left) and & (right) relative to Exercise 12 


and the graph of h is that of Fig. 2.34, left. 
Proceeding as above, 
ge? —3¢+2 iff <1, 
k(x) = ¢ 0 i a Ue i see 
a—5e+6 ife>3, 


and k has a graph as in Fig. 2.34, right. 


63 


3 


Limits and continuity I 


This chapter tackles the limit behaviour of a real sequence or a function of one 
real variable, and studies the continuity of such a function. 


3.1 Neighbourhoods 


The process of defining limits and continuity leads to consider real numbers which 
are ‘close’ to a certain real number. In equivalent geometrical jargon, one considers 
points on the real line ‘in the proximity’ of a given point. Let us begin by making 
mathematical sense of the notion of neighbourhood of a point. 


Definition 3.1 Let 7) € R be a point on the real line, and r > 0 a real 
number. We call neighbourhood of xo of radius r the open and bounded 
interval 


I,(a0) = (to —7, 20 +r) ={xe#ER : |x — 20] <r}. 


Hence, the neighbourhood of 2 of radius 1071, denoted I,9-1(2), is the set of real 
numbers lying between 1.9 and 2.1, these excluded. By understanding the quantity 
|x — xo| as the Euclidean distance between the points xo and x, we can then say 
that I,(xo) consists of the points on the real line whose distance from zo is less 
than r. If we interpret |x — x9| as the tolerance in the approximation of x9 by 
x, then I,(%9) becomes the set of real numbers approximating x9 with a better 
margin of precision than r. 


to —T Zo Lo+r 


Figure 3.1. Neighbourhood of xo of radius r 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_3, 
© Springer International Publishing Switzerland 2015 


66 3 Limits and continuity I 


Varying r in the set of positive real numbers, while mantaining xo in R fixed, 
we obtain a family of neighbourhoods of x9. Each neighbourhood is a proper 
subset of any other in the family that has bigger radius, and in turn it contains 
all neighbourhoods of lesser radius. 


Remark 3.2 The notion of neighbourhood of a point zo € R is nothing but a 
particular case of the analogue for a point in the Cartesian product R@ (hence the 
plane if d = 2, space if d = 3), presented in Definition 8.11. 

The upcoming definitions of limit and continuity, based on the idea of neigh- 
bourhood, can be stated directly for functions on R?%, by considering functions of 
one real variable as subcases for d = 1. We prefer to follow a more gradual ap- 
proach, so we shall examine first the one-dimensional case. Sect. 8.5 will be devoted 
to explaining how all this generalises to several dimensions. 


It is also convenient to include the case where 29 is one of the points +00 or —oo. 


Definition 3.3 For any real a > 0, we call neighbourhood of +oo with 
end-point a the open, unbounded interval 


I,(+00) = (a, +00). 


Similarly, a neighbourhood of —oo with end-point —a will be defined as 


(—oo, —a). 


—oo —a 0) a +00 


Figure 3.2. Neighbourhoods of —oo (left) and +00 (right) 


The following convention will be useful in the sequel. We shall say that the 
property P(x) holds ‘in a neighbourhood’ of a point c (c being a real number 2p or 
+00, —oo) if there is a certain neighbourhood of c such that for each of its points 
x, P(x) holds. Colloquially, one also says ‘P(x) holds around c’, especially when 
the neighbourhood needs not to be specified. For example, the map f(x) = 2a -—1 
is positive in a neighbourhood of ao = 1; in fact, f(a) > 0 for any x € Jy /2(1). 


3.2 Limit of a sequence 


Consider a real sequence a : nN ++ dy. We are interested in studying the behaviour of 
the values a, as n increases, and we do so by looking first at a couple of examples. 


3.2 Limit of a sequence 67 


Examples 3.4 


nr 
i) Let a, = ——. The first terms of this sequence are presented in Table 3.1. We 
n 


see that the values approach 1 as n increases. More precisely, the real number 1 
can be approximated as well as we like by a,, for n sufficiently large. This clause 
is to be understood in the following sense: however small we fix ¢ > 0, from a 
certain point n- onwards all values a, approximate 1 with a margin smaller that 
E. 

The condition |a,, — 1| < «, in fact, is tantamount to 


1 . 1 
<é,ie.,n+1>-; 
1 E 


1 
thus defining n- = H and taking any natural number n > n-, we haven+1 > 
€ 


it 1 
—|+1> -, hence |a, —1| < ¢. In other words, for every € > 0, there exists an 
E E 


ne such that 
it. > Me => lan — 1] <e. 


Looking at the graph of the sequence (Fig. 3.3), one can say that for all n > n- 
the points (n,a,) of the graph lie between the horizontal lines y = 1 — € and 
y=1t+e. 


an 


0.00000000000000 
0.50000000000000 
0.66666666666667 
0.75000000000000 
0.80000000000000 
0.83333333333333 
0.85714285714286 
0.87500000000000 
0.88888888888889 
0.90000000000000 
0.90909090909090 
100 0.99009900990099 
1000 0.99900099900100 
10000 | 0.99990000999900 
100000 | 0.99999000010000 
1000000 | 0.99999900000100 
10000000 | 0.99999990000001 
100000000} 0.99999999000000 


2.0000000000000 
2.2500000000000 
2.3703703703704 
2.4414062500000 
2.4883200000000 
2.5216263717421 
2.0464996970407 


2.9657845139503 
2.5811747917132 
10 2.9937424601000 

100 2.7048138294215 
1000 2.7169239322355 
10000 | 2.7181459268244 
100000 | 2.7182682371975 
1000000 | 2.7182804691564 
10000000 | 2.7182816939804 
100000000} 2.7182817863958 


Semnanrkwnweo 3 
CONANT WHrH] 3 


Table 3.1. Values, estimated to the 14th digit, of the sequences an = 47 (left) and 
Qn = (1+ +)” (right) 


68 3 Limits and continuity I 


T+e |} 
1 
L—e fren r nnn gerne et ee Feeeee 


Ne 

Figure 3.3. Convergence of the sequence an = ~t> 
1 nm 

ii) The first values of the sequence a, = (1 “f =| are shown in Table 3.1. One 
n 


could imagine, even expect, that as n increases the values a, get closer to a 
certain real number, whose decimal expansion starts as 2.718... This is actually 
the case, and we shall return to this important example later. 


We introduce the notion of converging sequence. For simplicity we shall assume 
the sequence is defined on the set {n € N : n > no} for a suitable no > 0. 


Definition 3.5 A sequencea:nt> a, converges to the limit @ € R (or 
converges to ¢ or has limit @), in symbols 


lint sa, =", 
Noo 


if for any real number ¢ > 0 there exists an integer ne such that 


Yn > no, ote | = |Get << e, 


Using the language of neighbourhoods, the condition n > nz can be written n € 
In.(+00), while ja, — | < ¢ becomes a, € I-(€). Therefore, the definition of 
convergence to a limit is equivalent to: for any neighbourhood I,(@) of @, there 
exists a neighbourhood I, (+00) of +00 such that 


Vn > no, n€In,(+oo) => an €I,(2). 


Examples 3.6 


i) Referring to Example 3.4 i), we can say 


er 


lim 
n>oon+1 


3.2 Limit of a sequence 69 
ii) Let us check that 
3n 
im ——~ = 


Given ¢ > 0, we must show 


on <eé 
2+ 5n? 
for all n greater than a suitable natural number n-. Observing that for n > 1 
3n 3n Z 3n 3 
2+5n?| 2+5n2 ~5n? — 5n’ 
we have 
3 =e 3n 
5n 24+ 5n2 
But 
3 
—<e —— n> 
on E 
so we can set Ne = [=]. Oo 
DE 


Let us examine now a different be- 
haviour as n increases. Consider for ny an 
instance the sequence 0 0 
a:N4 Gn =n". i i 
2 4 
Its first few values are written in Table 3 9 
3.2. Not only the values seem not to 4 16 
converge to any finite limit @, they are 5 25 
not even bounded from above: how- 6 36 
ever large we choose a real number 7 49 
A > 0, if n is big enough (meaning 8 64 
larger than a suitable n4), ay will be 9 81 
bigger than A. In fact, it is sufficient 10 100 
to choose n4 = [vA] and note 100 10000 
1000 1000000 
n>na =n>VA sn?>A. 10000 | 100000000 
100000}10000000000 


One says that the sequence diverges 


to +oo when that happens. Table 3.2. Values of a, =n? 


In general the notion of divergent sequence is defined as follows. 


70 3 Limits and continuity I 


Definition 3.7 The sequence a: n ++ a, tends to +00 (or diverges to 
+oo, or has limit +00), written 


lim adn = +00, 
Noo 


if for any real A > 0 there exists an n, such that 


Vn ia, = ie = ee A, 


Using neighbourhoods, one can also say that for any neighbourhood I,4(+0o) of 
+oo, there is a neighbourhood I,,, (+00) of +00 satisfying 


Yn > no, né€In,(+00) => ay € I4(+o0). 


The definition of 
lim ay, = —oo 
n—->co 
is completely analogous, with the proviso that the implication of (3.1) is changed 
to 
Vn > no, >a => GG, < —A. 


Examples 3.8 


i) From what we have seen it is clear that 
lim n? = +00. 
nN—- co 
nm 


ii) The sequence an =0+1+2+...4+n= YS k, associates to n the sum of the 
k=0 
natural numbers up to n. To determine the limit we show first of all that 


oe ae (3.2) 
k=0 


a relation with several uses in Mathematics. For that, note that a, can also be 
n 


written as dn =n+(n—1)+...4+24+14+0= 5) (n—h), hence 


k=0 
Din =Sok+ )/(n-k) =) n=n > 1 =n(n+1), 
k=0 k=0 k=0 k=0 
1 1 2 
and the claim follows. Let us verify lim mast) = +o0. Since mae) > a 
noo 


we can proceed as in the example above, so for a given A > 0, we may choose 


na = [V2A] 


3.2 Limit of a sequence 71 


The previous examples show that some sequences are convergent, other di- 
vergent (to +00 or —oo). But if neither of these cases occurs, one says that the 
sequence is indeterminate. Such are for instance the sequence a, = (—1)”, which 
we have already met, or 


ta = (1+ (-1")n={ 


2n for n even, 
0 for n odd. 


A sufficient condition to avoid an indeterminate behaviour is monotonicity. 
The definitions concerning monotone functions, given in Sect.2.4, apply to se- 
quences, as well, which are nothing but particular functions defined over the nat- 
ural numbers. For them they become particularly simple: it will be enough to 
compare the values for all pairs of subscripts n, n + 1 belonging to the domain of 
the sequence. So, a sequence is monotone increasing if 


Vitengy Oi Sls ile 


the other definitions being analogous. The following result holds. 


Theorem 3.9 A monotone sequence a: n +> Ay is either convergent or 
divergent. Precisely, in case Gn 18 increasing: 


i) if the sequence is bounded from above, i.e., there is an upper bound b € R 
such that a, < 6 for all n > no, then the sequence converges to the 
supremum & of its image: 


limo, —4 — SUD a, = tots 
n—->co 


ii) if the sequence is not bounded from above, then it diverges to +c0. 


In case the sequence is decreasing, the assertions modify in the obvious way. 


Proof. Assume first that {a,,} is bounded from above, which is to say that @ = 
sup {a, : n > no} € R. Due to conditions (1.7), for any ¢ > 0 there exists 
an element a, such that €—¢€ < an, < @. As the sequence is monotone, 
Qn. < Qn, VN > Ne; moreover, a, < £, Vn > no by definition of the 
supremum. Therefore 


l-Ee<an<l<lt+e, Vit 2 fie. 


hence each term ay with n > nz belongs to the neighbourhood of ¢ of 
radius €. But this is precisely the meaning of 


lim a, = é. 
NOOO 


Let now = +oo. Put differently, for any A > 0 there exists an element 
Gn, 80 that a,, > A. Monotonicity implies a, > an, > A, Vn > na. Thus 


72 3 Limits and continuity I 


every G@, with n > na belongs to the neighbourhood I4(+00) = (A, +c0) 
of +00, i.e., 


lim a, = +00. 
TCO 


Example 3.10 


Let us go back to Example 3.4 i). The sequence a, = ~ 1 is strictly increasing, 
1 
for Gn < Gn41, 1e., a ee at is equivalent to n(n +2) < (n+1)?, hence 


n? +2n <n?+2n+ 1, which is valid for any n. 


Moreover, a, < 1 for all n > 0; actually, 1 is the supremum of the set {a, : n © 
N}, as remarked in Sect. 1.3.1. Theorem 3.9 recovers the already known result 


lim a, = 1. 
noo 


The number e 
1 n 

Consider the sequence a, = (1 + =| introduced in Example 3.4 ii). It is possible 
n 


to prove that it is a strictly increasing sequence (hence in particular a, > 2 = a, for 
any n > 1) and that it is bounded from above (a, < 3 for all n). Thus Theorem 3.9 
ensures that the sequence converges to a limit between 2 and 3, which traditionally 
is indicated by the symbol e: 


This number, sometimes called Napier’s number or Euler’s number, plays a 
role of the foremost importance in Mathematics. It is an irrational number, whose 
first decimal digits are 

e = 2.71828182845905 - - - 


Proofs of the stated properties are given in Appendix A.2.3, p. 437. 

The number e is one of the most popular bases for exponentials and logarithms. 
The exponential function y = e” shall sometimes be denoted by y = expx. The 
logarithm in base e is called natural logarithm and denoted by log or In, instead 
of log, (for the base-10 logarithm, or decimal logarithm, one uses the capitalised 
symbol Log). 


3.3 Limits of functions; continuity 


Let f be a real function of real variable. We wish to describe the behaviour of 
the dependent variable y = f(x) when the independent variable x ‘approaches’ a 
certain point 29 € R, or one of the points at infinity —oo, +oo. We start with the 
latter case for conveniency, because we have already studied what sequences do at 
infinity. 


3.3 Limits of functions; continuity 73 
3.3.1 Limits at infinity 


Suppose f is defined around +o. In analogy to sequences we have some definitions. 


Definition 3.11 The function f tends to the limit ¢ € R for x going to 
+oo, in symbols 


lim f(a) =, 


xL—+00 


if for any real number € > 0 there is a real B > 0 such that 


Va € dom f, oe fa oe ee 


This condition requires that for any neighbourhood J-(¢) of @, there exists a neigh- 
bourhood Ig(+oo) of +00 such that 


Va € dom f, zéIp(t+oo) = f(x) € I.(2). 


Definition 3.12 The function f tends to +oo for x going to +00, in 
symbols 

lim 

L—+00 


if for each real A > 0 there is a real B > 0 such that 


Va € dom f, Ce Be fn) A. 


For functions tending to —oo one should replace f(z) > A by f(x) < —A. The 
expression 


means _ im |F(e)| = Foo. 


If f is defined around —ov, Definitions 3.11 and 3.12 modify to become defin- 
itions of limit (L, finite or infinite) for x going to —oo, by changing x > B to 
x<—B: 


ee 
At last, by 
(orn Ge) — 15 
LOO 


one intends that f has limit L (finite or not) both for s + +00 and x — —oo. 


74 3 Limits and continuity I 


Examples 3.13 
i) Let us check that 
e+Qr (1 
im ——=-. 
a>too 27241 2 
Given < > 0, the condition | f(x) — $| < e is equivalent to 
Az —1 
2(2a2 + 1) 
Without loss of generality we assume x > +, so that the absolute value sign can 
be removed. Using simple properties of fractions 
4c —1 22 22 1 


<€E. 


ee ee fee, 
D(202 +1) ~ 2a? +1 ~ 2a? f° ee 
i 
Thus (3.4) holds for B = max G. =): 
é 
ii) We prove 
lim fx = +00. 


L—-+00 
Let A > 0 be fixed. Since ,/x > A implies x > A?, putting B = A? fulfills (3.5). 


iii) Consider 


With ¢ > 0 fixed, 


al 
is tantamount to ~l1—2 > -, that is 1-2 > or z < 1—-. So taking 
€ € 


e2’ 


1 
B=max (0, == 1), we have 
€ 


<€E. 


zr<-B => | 


V1l-—2 
3.3.2 Continuity. Limits at real points 


We now investigate the behaviour of the values y = f(x) of a function f when x 
‘approaches’ a point 29 € R. Suppose f is defined in a neighbourhood of 29, but 
not necessarily at the point xo itself. Two examples will let us capture the essence 
of the notions of continuity and finite limit. Fix x9 = 0 and consider the real 


functions of real variable f(x) = 2° +1, g(x) = «+ [1 —2?] and A(x) = a 


(recall that [z] indicates the integer part of z); their respective graphs, at least in 
a neighbourhood of the origin, are presented in Fig. 3.4 and 3.5. 


As far as g is concerned, we observe that |z| < 1 implies 0 < 1— 2x? < 1 and 
g assumes the value 1 only at x = 0; in the neighbourhood of the origin of unit 
radius then, 


3.3 Limits of functions; continuity 75 


Figure 3.4. Graphs of f(z) = 2° + 1 (left) and g(x) = x + [1 — 27] (right), in a 
neighbourhood of the origin 


@={) if o=0, 
a |e Ife 20, 


as the picture shows. Note the function h is not defined in the origin. 

For each of f and g, let us compare the values at points x near the origin with 
the actual value at the origin. The two functions behave rather differently. The 
value f(0) = 1 can be approximated as well as we like by any f(x), provided x 
is close enough to 0. Precisely, having fixed an (arbitrarily small) ‘error’ « > 0 in 
advance, we can make | f(x) — f(0)| smaller than e¢ for all x such that |x — 0] = |z| 
is smaller than a suitable real 6 > 0. In fact | f(x) — f(0)| = |x3| = |a|° < ¢ means 
|x| < ¥/e, so it is sufficient to choose 6 = ¥/e. We shall say that the function f is 
continuous at the origin. 


sin x 


Figure 3.5. Graph of h(x) = around the origin 


76 3 Limits and continuity I 


On the other hand, g(0) = 1 cannot be approximated well by any g(x) with 
close to 0. For instance, let ¢ = £. Then |g(x) — g(0)| < € is equivalent to 


=< Oe). <= 8; but all x different from 0 and such that, say, |x| < 4, satisfy 


x 
4 
5 
—4 < g(x) =x < $, in violation to the constraint for g(x). The function g is not 
continuous at the origin. 

At any rate, we can specify the behaviour of g around 0: for x closer and closer 
to 0, yet different from 0, the images g(x) approximate not the value g(0), but 
rather € = 0. In fact, with e > O fixed, if x # 0 satisfies |x| < min(¢,1), then 
g(x) = x and |g(x) — £| = |g(x)| = |z| < €. We say that g has limit 0 for x going 
to 0. 

As for the function h, it cannot be continuous at the origin, since comparing 
the values h(a), for x near 0, with the value at the origin simply makes no sense, 
for the latter is not even defined. Neverthless, the graph allows to ‘conjecture’ that 
these values might estimate @ = 1 increasingly better, the closer we choose x to 
the origin. We are lead to say h has a limit for x going to 0, and this limit is 1. 
We shall substantiate this claim later on. 

The examples just seen introduce us to the definition of continuity and of 
(finite) limit. 


Definition 3.14 Let x9 be a point in the domain of a function f. This func- 
tion is called continuous at Xo if for any € > 0 there is a 6 > O such that 


Vx € dom f, e— ep = (af (ag) | = 2. (3.6) 


In neighbourhood-talk: for any neighbourhood J-(f(xo)) of f(ao) there exists a 
neighbourhood J5(xo) of zo such that 


Va € dom f, zEels(zo) => f(x) € Ee(f(xo)). (3.7) 


Definition 3.15 Let f be a function defined on a neighbourhood of xo € R, 
except possibly at xo. Then f has limit 2 € R (or tends to ¢ or converges 
to ¢) for x approaching 20, written 


lime) — 


L—XO 


if given any € > 0 there exists a 6 > 0 such that 


Va € dom f, Die ee) oe in vale, (3.8) 


Alternatively: for any given neighbourhood I-(¢) of @ there is a neighbourhood 
I5(xo) of xo such that 


3.3 Limits of functions; continuity 77 


Figure 3.6. Definition of finite limit of a function 


Vx € dom f, Gelso) \ eo} | he) ee) 


The definition of limit is represented in Fig. 3.6. 


Let us compare the notions just seen. To have continuity one looks at the values 
f(x) from the point of view of f(xo), whereas for limits these f(a) are compared 
to @, which could be different from f(x), provided f is defined in xo. To test the 
limit, moreover, the comparison with x = xo is excluded: requiring 0 < |x — 29 
means exactly x # Xo; on the contrary, the implication (3.6) is obviously true for 
r= IX. 

Let f be defined in a neighbourhood of zo. If f is continuous at xo, then (3.8) 
is certainly true with @ = f(xo); vice versa if f has limit 0 = f(x) for x going to 
xo, then (3.6) holds. Thus the continuity of f at xo is tantamount to 


lim f(x) = f(xo). (3.9) 


In both definitions, after fixing an arbitrary « > 0, one is asked to find at 
least one positive number 6 (‘there is a 6’) for which (3.6) or (3.8) holds. If either 
implication holds for a certain 6, it will also hold for every 6’ < 6. The definition 
does not require to find the biggest possible 6 satisfying the implication. With this 
firmly in mind, testing continuity or verifying a limit can become much simpler. 


Returning to the functions f, g, h of the beginning, we can now say that f is 
continuous at xp = 0, 


lim f(x) =1= f(0), 


x—0 


whereas g, despite having limit 0 for x — 0, is not continuous: 


lim g(x) = 0 4 9(0). 


78 3 Limits and continuity I 
We shall prove in Example 4.6 i) that h admits a limit for x going to 0, and actually 
li = 1, 
ea 
The functions g and h suggest the following definition. 


Definition 3.16 Let f be defined on a neighbourhood of xo, excluding the 
point xo. If f admits limit 2€ R for x approaching xo, and if a) f is defined 


in xo but f(xo) ££, or b) f is not defined in xo, then we say Xo is a (point 
of) removable discontinuity for f. 


The choice of terminology is justified by the fact that one can modify the function 
at xo by defining it in xo, so that to obtain a continuous map at xo. More precisely, 
the function 


is such that - 7 
lim f(x) = lim f(x) =£= f (ao), 


L—+>XoO xL—+>XLO 
hence it is continuous at Zo. 
For the above functions we have g(x) = x in a neighbourhood of the origin, 


while 
sin x 


iG = ue 0, 


1 ifz@—0. 


sin 
In the latter case, we have defined the continuous prolongation of y = : e 


x 
by assigning the value that renders it continuous at the origin. From now on when 
sin x 
referring to the function y = ——, we will always understand it as continuously 
x 


prolonged in the origin. 


Examples 3.17 


We show that the main elementary functions are continuous. 


i) Lett f : R-> R, f(x) = ax+b and x € R be given. For any € > 0, 


| f(x) — f(xo)| < ¢ if and only if |a| |z —axo| < ¢. When a = 0, the condition holds 
E 


for any x € R; if a £0 instead, it is equivalent to |x —xo0| < and we can put 


jal’ 
E 

i= fal in (3.6). The map f is thus continuous at every xo € R. 
a 

ii) The function f :R—R, f(x) = 2? is continuous at xo = 2. We shall prove 

this fact in two different ways. Given € > 0, |f(x) — f(2)| < e, or |2? —4| <e, 

means 


ee a ee ee (3.10) 


3.3 Limits of functions; continuity 79 


We can suppose ¢ < 4 (for if | f(x) — f(2)| < © for a certain ¢, the same will 
be true for all «’ > €); as we are looking for x in a neighbourhood of 2, we can 
furthermore assume x > 0. Under such assumptions (3.10) yields 


A-ex<a<v4te, 


=(Q=/4 =e) < e2 eae = 2. (3.11) 


This suggests to take 6 = min(2 — /4—¢, /4+¢ —2) (= V4+6e — 2, easy to 
verify). If | — 2| < 6, then (3.11) holds, which was equivalent to |x? — 4| < e. 
With a few algebraic computations, this furnishes the greatest 6 for which the 
inequality |x? — 4| < « is true. 
We have already said that the largest value of 6 is not required by the definitions, 
so we can also proceed alternatively. Since 
Ja? — 4] = |(a — 2)(@ + 2)| = |x — 2||2 +2], 

by restricting x to a neighbourhood of 2 of radius < 1, we will have —1 < r-2 < 
1, hence 1 < x < 3. The latter will then give 3< «+2 =|x+2| < 5. Thus 

|x? — 4| < 5la — QI. (3.12) 


To obtain |x? — 4| < ¢ it will suffice to demand |x — 2| < = since (3.12) holds 


hence 


when |x — 2| < 1, we can set 6 = min a =) and the condition (3.6) will be 


satisfied. The neighbourhood of radius < 1 was arbitrary: we could have chosen 
any other sufficiently small neighbourhood and obtain another 6, still respecting 
the continuity requirement. 

Note at last that a similar reasoning tells f is continuous at every xo € R. 


iii) We verify that f : R— R, f(x) = sinz is continuous at every zo € R. We 
establish first a simple but fundamental inequality. 


Lemma 3.18 For any x € R, 


\esianeee| << paeb 


with equality holding if and only if x = 0. 


TT 


Proof. Let us start assuming 0 < x < § and look at the right-angled triangle 
PHA of Fig. 3.7. The vertical side PH is shorter than the hypotenuse PA, 
whose length is in turn less than the length of the arc PA (the shortest 
distance between two points is given by the straight line joining them): 


PH <PA<PA. 


By definition PH = sinx > 0, and PA = x > 0 (angles being in radians). 


Thus (3.13) is true. The case = = oe =< Dis treated with the same 


80 3 Limits and continuity I 


A 


sin x 


Figure 3.7. |sinz| < |z| 


argument observing | sin z| = sin |2| for 0 < |z| < 4. At last, when |x| > $ 
one has |sina| < 1 < $ < |z|, ending the proof. Oo 


Thanks to (3.13) we can prove that sine is a continuous function. Recalling 
formula (2.14), 
L— Xo r+ Xo 

Cos 


sin © — sin Zp = 2sin 


2 p) 
(3.13) and the fact that | cost] < 1 for all t € R, imply 
: ‘ . &— XO L+ Xo 
|sinxz — sin zo| = 2 |sin eos 5 
«L— XO 


<2 


-1= |x — aol. 


Therefore, given an € > 0, if |x — xo| < € we have |sinz — sin xo| < ¢; in other 
words, condition (3.6) is satisfied by 6 = «. 

Similarly, formula (2.15) allows to prove g(x) = cosz is continuous at every 
xo ER. 


Definition 3.19 Let I be a subset of dom f. The function f is called con- 


tinuous on J (or over I) if f is continuous at every point of I. 


We remark that the use of the term ‘map’ (or ‘mapping’) is very different from 
author to author; in some books a map is simply a function (we have adopted 
this convention), for others the word ‘map’ automatically assumes continuity, so 
attention is required when browsing the literature. 


3.3 Limits of functions; continuity 81 


The following result is particularly relevant and will be used many times 
without explicit mention. For its proof, see Appendix A.2.2, p. 4386. 


Proposition 3.20 All elementary functions (polynomials, rational func- 


tions, powers, trigonometric functions, exponentials and their inverses) are 
continuous over their entire domains. 


Let us point out that there exists a notion of continuity of a function on a 
subset of its domain, that is stronger than the one given in Definition 3.19; it is 
called uniform continuity. We refer to Appendix A.3.3, p. 447, for its definition 
and main properties. 


Now back to limits. A function f defined in a neighbourhood of xo, xo excluded, 
may assume bigger and bigger values as the independent variable x gets closer to 
xo. Consider for example the function 


1 
aC 
on R \ {3}, and fix an arbitrarily large real number A > 0. Then f(x) > A for all 
1 
x # £ such that |x — 3| < Tx We would like to say that f tends to +00 for x 


approaching 29; the precise definition is as follows. 


Definition 3.21 Let f be defined in a neighbourhood of xo € R, except pos- 
sibly at xo. The function f has limit +oo (or tends to +00) for x ap- 
proaching 70, in symbols 


lim f(x) = +00, 


xwL—->>XO 


if for any A> 0 there is a 6 > 0 such that 


V« € dom f, le aol ae) = A. (3.14) 


Otherwise said, for any neighbourhood I,4(+00) of +00 there exists a neighbour- 
hood I5(xo) di x9 such that 


Va € dom f, x EIs(%o)\ {zo} => f(x) € Ia(+oo). 


The definition of 
lim f(x) = —oo 


H Oe at 10) 


follows by changing f(x) > A to f(x) < —A. 


82 3 Limits and continuity I 


One also writes 


ee 


to indicate lim |f(ax)| = +oo. For instance the hyperbola f(x) = +, with graph 
w+ XO 


x 2 
in Fig. 2.2, does not admit limit for x + 0, because on each neighbourhood J5(0) of 
the origin the function assumes both arbitrarily large positive and negative values 
together. On the other hand, |f(a)| tends to +00 when z nears 0. In fact, for fixed 


A>0O i ; 
R — — >A. 
Va €R \ {0}, eh = a il > 


1 
Hence lim — = ov. 
x0 2X 


3.3.3 One-sided limits; points of discontinuity 


The previous example shows that a map may have different limit behaviours at 
il 

the left and right of a point zo. The function f(x) = — grows indefinitely as x 
a 


takes positive values tending to 0; at the same time it becomes smaller as x goes 
to 0 assuming negative values. Consider the graph of the mantissa y = M(x) (see 
Fig. 2.3, p. 34) on a neighbourhood of x = 1 of radius < 1. Then 


et Tee 1 
x-1 ifa>l1. 


M(a) = { 


When z approaches 1, M tends to 0 if x takes values > 1 (i.e., at the right of 1), 
and tends to 1 if z assumes values < 1 (at the left). 

The notions of right-hand limit and left-hand limit (or simply right limit and 
left limit) arise from the need to understand these cases. For that, we define right 
neighbourhood of x9 of radius r > 0 the bounded half-open interval 


I*(x9) = [29,20 +r) ={x ER : 0O< 4-2 <r}. 


The left neighbourhood of xo of radius r > 0 will be, similarly, 


ie (fol =—@e— t= ee R Osan 2 = eh, 


Tr 


Substituting the condition 0 < |x — xo| < 6 (ie.,  € Is(xo) \ {zo}) with 0 < 
a—ax <6 (ie. « € I} (x9) \ {xo}) in Definitions 3.15 and 3.21 produces the 
corresponding definitions for right limit of f for x tending to xo, otherwise 
said limit of f for x approaching xo from the right or limit on the right; 
such will be denoted by 


lim vr). 


3.3 Limits of functions; continuity 83 


For a finite limit, this reads as follows. 


Definition 3.22 Let f be defined on a right neighbourhood of xo € R, except 
possibly at xo. The function f has right limit @ € R for 7 > 0, if for every 


€ > 0 there is ad > 0 such that 


Va € dom f, O22 —i=6 = |fay-7)— = 


Alternatively, for any neighbourhood I-(@) di @ there exists a right neighbourhood 
I$ (ao) of ao such that 


Va € dom f, cel (Gp \4zo) = fleet). 


The notion of continuity on the right is analogous. 


Definition 3.23 A function f defined on a right neighbourhood of xo € R is 
called continuous on the right at xo (or right-continuous) if 
lim f(r) — f75): 


ft 
L>Xo 


If a function is only defined on a right neighbourhood of xo, right-continuity co- 
incides with the earlier Definition (3.6). The function f(z) = \/x for example is 
defined on [0,-++0o), and is continuous at 0. 


Limits of f from the left and left-continuity are completely similar: now one 
has to use left neighbourhoods of x9; the left limit shall be denoted by 


lim f(z). 


L>Lo 


The following easy-to-prove property provides a handy criterion to study limits 
and continuity. 


Proposition 3.24 Let f be defined in a neighbourhood of xo € R, with the 
possible exception of xo. The function f has limit L (finite or infinite) for 
x — xo if and only if the right and left limits of f, for x > x0, exist and 
equal L. 


A function f defined in a neighbourhood of xo is continuous at xo if and only 
if it is continuous on the right and on the left at xo. 


84 3 Limits and continuity I 


Returning to the previous examples, it is not hard to see 


lim — = +00; lim — = —oo 
xz30+t & z>0- & 
and 
lim M(«) = 0; lim, Ae), 
xit x17 


Note M(1) = 0, so lim, M(«x) = M(1). All this means the function M (x) is con- 
x“ 


tinuous on the right at x9 = 1 (but not left-continuous, hence neither continuous, 
at po = 1). 


Definition 3.25 Let f be defined on a neighbourhood of xp € R, except pos- 
sibly at xo. If the left and right limits of f for x going to xo are different, we 
say that xo is a (point of ) discontinuity of the first kind (or a jump 


point) for f. The gap value of f at xo is the difference 


lim f(z)— lim f(z). 


ab = 
LX og I +ZLog 


Thus the mantissa has a gap = —1 at xp = 1 and, in general, at each point 
Lo = NE Z. 
Also the floor function y = [a] jumps, at each xo = n € Z, with gap = 1, for 


lira [as 90; lim [#] =n—-1. 
zont Ln 


The sign function y = sign (a) has a jump point at xp = 0, with gap = 2: 
lim sign (x) = 1; lim sign (x) = —1. 


xz—0t xz—0- 


Definition 3.26 A discontinuity point which is not removable, nor of the 


first kind is said of the second kind. 


This occurs for instance when f does not admit limit (neither on the left nor 
on the right) for « + xo. The function f(«) = sin+ has no limit for x — 0 (see 
Fig. 3.8 and the explanation in Remark 4.19). 


3.3.4 Limits of monotone functions 


Monotonicity affects the possible limit behaviour of a map, as the following results 
explain. 


3.3 Limits of functions; continuity 85 


Figure 3.8. Graph of f(x) = sin + 


Theorem 3.27 Let f be a monotone function defined on a right neighbour- 
hood I*(c) of the point c (where c is real or —co), possibly without the point 
c itself. Then the right limit for x > c exists (finite or infinite), and precisely 


ee :¢@€It(c),x>c} if f is increasing, 


sup{f(z):2€It(c),x>c} if f is decreasing. 


In the same way, f monotone on a left neighbourhood I~ (c)\{c} of ¢ (ce real 
or +00) satisfies 


sup{f(z):2¢€1-(c),2«<c} if f is increasing, 
ia eee :xE€I-(c),2<c} if f is decreasing. 


Proof. We shall prove that if f increases in the right neighbourhood I*(c) of ¢ 
then 
lim, fe) Sint fie tel ie, eel: 
Ha a 


The other cases are similar. 
Let € = inf{f(x): a € I*(c), « > c} € R. The infimum is characterised, 
in analogy with (1.7), by: 
i) foralige Iie) \ {ce}, fiejet 
ii) for any e > 0, there exists an element x-€I*(c)\{c} such that 
f(@e) <f+e. 
By monotonicity we have 


T(x) = fee), Va € I*(c)\ {c}, 2 < ze, 


therefore 


86 3 Limits and continuity I 
L-e<l<fi(x)<lt+e, Veer" (ce) fel eo oee, 


So, each f(a) belongs to the neighbourhood of ¢ of radius « if x 4 ¢ is in 
the right neighbourhood of c with supremum x-. Thus we have 


lim sig) =z. 
x—ct 
Let now £ = —oo; this means that for any A > 0 there is an x4 € IT(c) \ 


{c} such that f(z4) < —A. Using monotonicity again we obtain f(r) < 
f(aa) < —A, Vx € I*(c) \ {c} and x < xa. Hence f(x) belongs to the 
neighbourhood of —oco with supremum —A provided x ¥ c is in the right 
neighbourhood of c of supremum x4. We conclude 


lim f(x) =—oo. 


w—>ct 


A straightforward consequence is that a monotone function can have only a 
discontinuity of the first kind. 


Corollary 3.28 Let f be monotone on a neighbourhood I(x) of xo € R. 
Then the right and left limits for x + x9 exist and are finite. More precisely, 


i) if f is increasing 


lim f(x) < f(wo) < lim, f(a); 


L+>XLo L+>Xo 


ii) if f is decreasing 


Narra fl eee a (ety ee lim, ile 


L+Xo L+XLo 


Proof. Let f be increasing. Then for all x € I(x) with x < xo, f(x) < f(xo). 
The above theorem guarantees that 


lima Jie) = sup sia) ee Ties), e < to} = Flea). 


LL 
Similarly, for x € I(x) with x > xo, 


Five) < intl f(a) se € lag), e > zo} = lim. facae 


L+>Lo 


from which i) follows. The second implication is alike. 


3.4 Exercises 87 


3.4 Exercises 


1. Using the definition prove that 


oe we ee Jim 7a =~ 


1 
2 = ae 
Tim (2x +3)=5 d) Jim, 2 = 06 
a 
a. itn, 24 f) lim Bie 


~L—-+—o0o 2 =] 


[2. | 2.| Let f(a) = sign (x? — x). Discuss the existence of the limits 


lim f(x) and lim f(x) 


xz—0 rl 


and study the function’s continuity. 


3. Determine the values of the real parameter a for which the following maps are 
continuous on their respective domains: 


] f=[oene ifx>0, b) fe = {en eas. 
227 +3 ifx <0 G2 diae< 


3.4.1 Solutions 


1. Limits: 


a) Let areal number A > 0 be given; it is sufficient to choose any natural number 
na > A and notice that ifn > n, then 


ml=n(n—1)---2-l>n>n,42>A. 
Thus lee n! = +00. 


b) Fix areal A > 0 and note 4 < —Ais the same as <4, > A. Forn > 1, 


means n?—2An+A > 0. Itw we Bsndiic: a natural number n4 > A+,/A(A+ 1), 
the inequality holds for alln > na. 


c) Fix e > 0 and study the condition | f(x) — é| < e: 
\2x? + 3 — 5] = 2la? — 1] = 2|r —1||e +1] <e. 


Without loss of generality we assume x belongs to the neighbourhood of 1 of 
radius 1, i.e., 


88 3 Limits and continuity I 


—l<a-—1<1, whence 0<2<2 and 1<a#+1=(|r4+1| <3. 


Therefore 
\2n? +3 —5| << 2-3)a —1| = 6|x — 1]. 
=. It will be enough to set 


The expression on the right is < ¢ if |jz—1| < § 
6 = min(1, =) to prove the claim. 
2. Since x2 — x > 0 when x <0 or x > 1, the function f(z) is thus defined: 


1 ifx<Oandz>l, 
fm=<0 te=0and¢=—1, 
—-1 if0<a2<1. 


So f is constant on the intervals (—oo, 0), (0,1), (1, +00) and 


pe fied, ae fas 
es eee 


The required limits do not exist. The function is continuous on all R with the 
exception of the jump points x = 0 and x = 1. 
3. Continuity: 
a) The domain of f is R and the function is continuous for x ¥ 0, irrespective of 
a. As for the continuity at « = 0, observe that 


lim f(x) = lim (227 +3) =3= f(0), 


xz—-07 xz—-07- 

1 
li = lim asi +—)=a. 
poe eg 


These imply f is continuous also in x = 0 if a = 3. 


by a= LL 


A 


Limits and continuity II 


The study of limits continues with the discussion of tools that facilitate compu- 
tations and avoid having to resort to the definition each time. We introduce the 
notion of indeterminate form, and infer some remarkable limits. The last part of 
the chapter is devoted to continuous functions on real intervals. 


4.1 Theorems on limits 


A bit of notation to begin with: the symbol c will denote any of xo, xj, 2, 
+00, —0o, co introduced previously. Correspondingly, [(c) will be a neighbourhood 
Is(x9) of zo € R of radius 6, a right neighbourhood J; (xo), a left neighbourhood 
I; (ao), a neighbourhood Ig (+00) of +00 with end-point B > 0, a neighbourhood 
Ip(—oo) of —oo with end-point —B, or a neighbourhood Ig(oo) = Ig(—oo) U 
Ip(+oo) of oo. 


We shall suppose from now on f, g, h,... are functions defined on a neighbour- 
hood of c with the point c deleted, unless otherwise stated. In accordance with the 
meaning of c, the expression lim f(x) will stand for the limit of f for > zo € R, 

w>>C 


the right or left limit, the limit for x tending to +00, —oo, or for |x| + +o. 


4.1.1 Uniqueness and sign of the limit 


We start with the uniqueness of a limit, which justifies having so far said ‘the limit 
of f’, in place of ‘a limit of f’. 


Theorem 4.1 (Uniqueness of the limit) Suppose f admits (finite or in- 


finite) limit € for x > c. Then f admits no other limit for x > c. 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_4, 
© Springer International Publishing Switzerland 2015 


90 4 Limits and continuity II 
£ g 
SS ——————— 
l—e l+e le +e 
Figure 4.1. The neighbourhoods of 2, ¢’ of radius e < $|¢— @’| are disjoint 


Proof. We assume there exist two limits @’ 4 @ and infer a contradiction. We 
consider only the case where @ and @’ are both finite, for the other situations 
can be easily deduced adapting the same argument. First of all, since 0’ 4 ¢ 
there exist disjoint neighbourhoods I(£) of @ and I(¢’) of 


I(€)N Ie) =0. (4.1) 


To see this fact, it is enough to consider neighbourhoods of radius € smaller 

or equal than half the distance of ¢ and £’,  < $|¢— | (Fig. 4.1). 

Taking I(¢), the hypothesis lim f(x) = @ implies the existence of a neigh- 
Le 


bourhood I(c) of c such that 
Va € dom f, wet(c)\{c} => fle) e712). 


Similarly for [(¢’), from lim f(x) = ¢’ it follows there is I'(c) with 


Va € dom f, zéEl'(c)\{c} = f(x) €1(l). 


The intersection of I(c) and I'(c) is itself a neighbourhood of c: it contains 
infinitely many points of the domain of f since we assumed f was defined 
in a neighbourhood of c (possibly minus c). Therefore if  € dom f is any 
point in the intersection, different from c, 


f(@) eLOUe), 


hence the intervals [(¢) and I(¢’) do have non-empty intersection, contra- 
dicting (4.1). Oo 


The second property we present concerns the sign of a limit around a point c. 


Theorem 4.2 Suppose f admits limit ¢ (finite or infinite) for x > c. If 
£>0 orl = +00, there exists a neighbourhood I(c) of c such that f is strictly 


positive on I(c) \ {c}. A similar assertion holds when £ < 0 or = —oo. 


Proof. Assume ¢ is finite, positive, and consider the neighbourhood I-(¢) of ¢ of 
radius « = ¢/2 > 0. According to the definition, there is a neighbourhood 
I(c) of c satisfying 


4.1 Theorems on limits 91 


Figure 4.2. Around a limit value, the sign of a map does not change 


V« € dom f, zel(c)\{c} => f(x) e€i(2). 


As I.(€) = (4, 3£) c (0, +00), all values f(x) are positive. 
If £ = +0co it suffices to take a neighbourhood I4(+00) = (A, +00) of +00 


(A > 0) and use the corresponding definition of limit. Oo 


The next result explains in which sense the implication in Theorem 4.2 can be 
‘almost’ reversed. 


Corollary 4.3 Assume f admits limit (finite or infinite) for x tending to 
c. If there is a neighbourhood I(c) of c such that f(x) > 0 in I(c) \ {c}, then 


€>0 or €=+00. A similar assertion holds for a ‘negative’ limit. 


Proof. By contradiction, if = —oo or ¢ < 0, Theorem 4.2 would provide a neigh- 
bourhood I’(c) of c such that f(x) < 0 on I'(c)\{c}. On the intersection of 
I(c) and I'(c) we would then simultaneously have f(x) < 0 and f(x) > 0, 
which is not possible. Oo 


Note that even assuming the stronger inequality f(x) > 0 on I(c), we would 
not be able to exclude @ might be zero. For example, the map 


_ fa? ife 40, 
oa if x =0, 


is strictly positive in every neighbourhood of the origin, yet lim, fix) =0. 
xz 


4.1.2 Comparison theorems 


A few results are known that allow to compare the behaviour of functions, the first 
of which generalises the above corollary. 


92 4 Limits and continuity I 


Corollary 4.4 (First comparison theorem) Let a function f have limit 


£ and a function g limit m (£,m finite or not) for « > c. If there is a 
neighbourhood I(c) of c such that f(x) < g(x) in I(c) \ {c}, then €<m. 


Proof. If = —oo or m =+oo there is nothing to prove. Otherwise, consider the 
map h(x) = g(x) — f(x). By assumption h(x) > 0 on I(c) \ {c}. Besides, 
Theorem 4.10 on the algebra of limits guarantees 


lim h(x) = lim g(x) — lim f(x) =m—-2Z. 


H tes vc LC 


The previous corollary applied to h forces m — > 0, hence the claim. 


We establish now two useful criteria on the existence of limits based on com- 
paring a given function with others whose limit is known. 


Theorem 4.5 (Second comparison theorem — finite case, also known 
as “Squeeze rule”) Let functions f, g and h be given, and assume f and h 
have the same finite limit for x — c, precisely 


di (a i ee 


LC xL—->C 


If there is a neighbourhood I(c) of c where the three functions are defined 


(except possibly at c) and such that 


f(x) S g(@) <h(x), Vw € I(e) \ te}, (4.2) 


Proof. We follow the definition of limit for g. Fix a neighbourhood I,-(¢) of ¢; by 
the hypothesis lim f(x) = € we deduce the existence of a neighbourhood 


I'(c) of c such that 
V« € dom f, eeEl(c)\{c} => f(xle (2. 
The condition f(x) € I-(@) can be written as | f(x) — ¢| < «€, or 
f—e< f(z) <f+e, (4.3) 


recalling (1.4). Similarly, lim h(a) = @ implies there is a neighbourhood 
H i a 6 
I" (c) of ec such that 


4.1 Theorems on limits 93 


Figure 4.3. The squeeze rule 


Va € domh, xcel"(c)\{c} => b-e<h(a)<lt+e. (44) 


Define then I’’(c) = I(c) NI'(c) NI" (c). On I’ (c) \ {c} the constraints 
(4.2), (4.3) and (4.4) all hold, hence in particular 


cel(c)\{c} = bl-e< f(x) <g(x) < h(x) <l+e. 


This means g(x) € I-(¢), concluding the proof. 


Examples 4.6 


i) Let us prove the fundamental limit 


sin x 


lim =k (4.5) 
m0) WE 
sin © sin(—2x —sinx sing 
Observe first that y = is even, for a, = = — . It is thus 
—2x —2x x 
sin 


=. 


sufficient to consider a positive x tending to 0, i.e., prove that lim 
x—0 XL 
sin x 


Recalling (3.13), for all > 0 we have sinz < a, or < 1. To find a 


£ 
lower bound, suppose x < 4 and consider points on the unit circle: let A have 
coordinates (1,0), P coordinates (cos z,sinz) and let Q be defined by (1, tan x) 
(Fig. 4.4). The circular sector OAP is a proper subset of the triangle OAQ, so 


Area OAP < Area OAQ. 
Since 


— 


aes = = > and Area OAQ = Om ee ag a 


Area OAP = , 
2 2 2 


94 4 Limits and continuity II 


A 


O A 


Figure 4.4. The sector OAP is properly contained in OAQ 


it follows 


x sin x . sin x 
i.e., cosxz < . 


2 ~ 2cosx’ 


Eventually, on 0 < x < 3 one has 
sin 
cose < —— < L. 
© 


The continuity of the cosine ensures na cos x = 1. Now the claim follows from 
xu 


the Second comparison theorem. 
sin x 


ii) We would like to study how the function g(x) = behaves for « tending 
to +oo. Remember that 

—l1<sing <1 (4.6) 
for any real x. Dividing by x > 0 will not alter the inequalities, so in every 
neighbourhood [4(-++0o) of +00 


1 1 1 
Now set f(x) = ——, h(x) = — and note lim — =0. By the previous theorem 
a a 


The latter example is part of a more general result which we state next (and 
both are consequences of Theorem 4.5). 


4.1 Theorems on limits 95 


Corollary 4.7 Let f be a bounded function around c, i.e., there exist a 
neighbourhood I(c) and a constant C > 0 such that 


Fo) <C, Va € I(c) \ {c}. (4.7) 


Let g be such that 
ling) — 0; 
LAE 


Then it follows 
tba Ge vole) — 10), 


bie 


Proof. By definition lim g(«) = 0 if and only if lim |g(x)| = 0, and (4.7) implies 
@—->c wc 


O0<|f@w)g@l<Clg@)|, va € Ic) \ {ce}. 


The claim follows by applying Theorem 4.5. 


Theorem 4.8 (Second comparison theorem — infinite case) Let f,g be 
given functions and 
lim f(z) = +00. 


UAE 


If there exists a neighbourhood I(c) of c, where both functions are defined 
(except possibly atc), such that 


(4.8) 


then 


A result of the same kind for f holds when the limit of g is —oo. 


Proof. The proof is, with the necessary changes, like that of Theorem 4.5, hence 
left to the reader. 


Example 4.9 
Compute the limit of g(x) = « +sinaz when x > +o0. Using (4.6) we have 
x—-l<2x4sinz, Vac ER. 


Set f(x) =a — 1; since lim f(x) = +00, the theorem tells us 
w+ 00 


lim (#+sinz) = +00. 
x—-+00 


96 4 Limits and continuity I 
4.1.3 Algebra of limits. Indeterminate forms of algebraic type 


This section is devoted to the interaction of limits with the algebraic operations 
of sum, difference, product and quotient of functions. 

First though, we must extend arithmetic operations to treat the symbols +00 
and —oo. Let us set: 


ifs € Ror s=+00) 
) 


ifs >0ors=+c0 


( 
(if s € R or s = —co 
( 
( 


) 
if s <0 or s=—oo) 


(if s > 0) 


A result of the foremost importance comes next. 


Theorem 4.10 Suppose f admits limit ¢ (finite or infinite) and g admits 
limit m (finite or infinite) for x + c. Then 


provided the right-hand-side expressions make sense. (In the last case 
assumes g(x) #0 on some I(c)\{c}.) 


Proof. 


4.1 Theorems on limits 97 


We shall prove two relations only, referring the reader to Appendix A.2.1, 
p. 433, for the ones left behind. The first we concentrate upon is 


lim (f(x) + 9(x)) =£+m 


xrL—->C 


when ¢ and m are finite. Fix ¢ > 0, and consider the neighbourhood of ¢ 
of radius ¢/2. By assumption there is a neighbourhood /’(c) of c such that 


Vx € dom f, zéEl'(c)\{c} => |f(x)-e) <e/2. 
For the same reason there is also an I’’(c) with 
Va € domg, 2el(c)\{e} = lo(e)—m| <2/2. 


Put I(c) = I'(c) NI" (c). Then if ¢ € dom f N dom g belongs to I(c) \ {c}, 
both inequalities hold; the triangle inequality (1.1) yields 
(F(x) + g(@)) — (E+ m)| = |(F (2) -— 2) + (g(@) — m)| 


S(@i-4 +o) mle e+s 


= Ey 
2 
proving the assertion. 


The second relation is 


lim (f(x) g(x)) = +00 


rc 


with € = +oo and m > 0 finite. For a given real A > 0, consider the 
neighbourhood of +00 with end-point B = 2A/m > 0. We know there is 
a neighbourhood I’(c) such that 


Va € dom f, eel(e)\{e} => f@)>B. 


On the other hand, considering the neighbourhood of m of radius m/2, 
there exists an I’(c) such that 


Ve € domg, zéEl'(c)\{c} = |g(x)-—m| <m/2, 


ie., m/2< g(x) < 3m/2. Set I(c) = I'(c) NI" (c). If e € dom f Ndomg is 
in I(c) \ {c}, the previous relations will be both fulfilled, whence 


f()9(@) > fe) > > BS =A. 


Corollary 4.11 If f and g are continuous maps at a point xo € R, then also 


f(2) 


f(x) + 9(x), f(x) g(x) and —— (provided g(xo) £0) are continuous at xo. 


g(x) 


98 4 Limits and continuity I 


Proof. The condition that f and g are continuous at x is equivalent to lim f(x) = 
L—>ZO 


f(o) and lim g(x) = g(xo) (recall (3.9)). The previous theorem allows 
H be a 0) 


to conclude. 


Corollary 4.12 Rational functions are continuous on their domain. In par- 


ticular, polynomials are continuous on R. 


Proof. We verified in Example 3.17, part i), that the constants y = a and the 
linear function y = x are continuous on R. Consequently, maps like y = ax” 
(n € N) are continuous. But then so are polynomials, being sums of the 
latter. Rational functions, as quotients of polynomials, inherit the property 
wherever the denominator does not vanish. 


Examples 4.13 


i) Calculate 
22% — SCOST 
lim ——————_. = ¢ 
z>0 5+a2sinz£ 
The continuity of numerator and denominator descends from algebraic oper- 
ations on continuous maps, and the denominator is not zero at x = 0. The 


substitution of 0 to x produces = —3/5. 


ii) Discuss the limit behaviour of y = tang when « — $. Since 


: i . OW P T 
lim sing = sin- = 1 and lim cosx = cos — = 0, 
the above theorem tells 
. ._ sina 1 
lim tanz = lim =--=o. 
275 x—+>F COS L 0 


But one can be more precise by looking at the sign of the tangent around 4. Since 
sinz > 0 in a neighbourhood of 5, while cosx > 0 (< 0) in a left (resp. right) 


neighbourhood of 4, it follows 


lim tan x = -Foo. 


me 
I>F 


P(e) 
Q(x) 


have no common factor. Call zo € R a zero of Q, i.e., a point such that Q(x) = 0. 
Clearly P(xo) # 0, otherwise P and Q would be both divisible by (x— 2). Then 


lim R(x) = co 


L>XO 


iii) Let R(x) = be rational and reduced, meaning the polynomials P, Q 


follows. In this case too, the sign of R(x) around of xo retains some information. 
2 
a oz +1 
x2 — 2 
negative on a right neighbourhood, so 


For instance, y = is positive on a left neighbourhood of 2) = 1 and 


4.1 Theorems on limits 99 


_ 2 -3r+1 
lim) —,~——— = 00. 
xls ww” —&@ 
x—2 
In contrast, the function y = ——>——— is negative in a whole neighbourhood 
e*—2¢4+1 


of x9 = 1, hence 
x—2 _ 
eolg2—Qe+1 


—cC. 


Theorem 4.10 gives no indication about the limit behaviour of an algebraic 
expression in three cases, listed below. The expressions in question are called in- 
determinate forms of algebraic type. 


i) Consider f(x)+g(x) (resp. f(x)—g(x)) when both f, g tend to oo with different 
(resp. same) signs. This gives rise to the indeterminate form denoted by the 
symbol 

00 — OO. 

ii) The product f(a) g(x), when one function tends to oo and the other to 0, is 

the indeterminate form with symbol 


oo: 0. 


f(x) 


iii) Relatively to (z)’ in case both functions tend to oo or 0, the indeterminate 
g(a 


forms are denoted with 
oe) 0 
x or 0° 

In presence of an indeterminate form, the limit behaviour cannot be told a 
priori, and there are examples for each possible limit: infinite, finite non-zero, zero, 
even non-existing limit. Every indeterminate form should be treated singularly and 
requires often a lot of attention. 

Later we shall find the actual limit behaviour of many important indeterminate 
forms. With those and this section’s theorems we will discuss more complicated in- 
determinate forms. Additional tools to analyse this behaviour will be provided fur- 
ther on: they are the local comparison of functions by means of the Landau symbols 


(Sect. 5.1), de ’H6pital’s Theorem (Sect. 6.11), the Taylor expansion (Sect. 7.1). 


Examples 4.14 
i) Let x tend to +00 and define functions f;(x) = x+27, fo(z) =a+1, f3(x) = 
a++4, fa(x) =x+sinz. Set g(x) = x. Using Theorem 4.10, or Example 4.9, one 
verifies easily that all maps tend to +oo. One has 


lim [fi(x) — g(x)]}= lim 2? = +00, 


x—> +00 ®L—+00 

lim [fe(#) —g(z)] = lim 1=1, 
t—+00 «t—>+00 

: : 1 
epg fae) — ge) = Ong 


100 4 Limits and continuity I 


whereas the limit of f4(x) — g(x) = sinax does not exist: the function sin x is 
periodic and assumes each value between —1 and 1 infinitely many times as 
XL —> +00. 

1 


rs) 


ii) Consider now x > 0. Let fi (x) = x°, fo(xz) = 2”, f(x) = 2, fa(x) = 2? sin 
and g(x) = a. All functions converge to 0 (for f4 apply Corollary 4.7). Now 


lim filz) = limz=0, 
230 g(x) = x30 

fin ites, 
z+0 g(x) rz 0 

lim 236%) _ | aes: 


fa(z) 
g(x) 


proof of this). 


but 


iL 
= sin— does not admit limit for « + 0 (Remark 4.19 furnishes a 
ar 


iii) Let us consider a polynomial 
P(x) = ana" +...+ 01% + ao (an #0) 
for « — +too. A function of this sort can give rise to an indeterminate form 
oo — oo according to the coefficients’ signs and the degree of the monomials 
involved. The problem is sorted by factoring out the leading term (monomial of 
maximal degree) x” 
Py =a" (an + 


The part in brackets converges to a, when x — oo, so 


An— a a 
anges 1 i > ) 


gn-l an 


The sign of the limit is easily found. For instance, 
lim (—5a° + 22?+7)= lim (—52?) = +00. 
L—+—0O L—? — CO 
Take now a reduced rational function 
R(x) = P(x) _ Anz” +...+a1z2+ a9 
Q(z) bmx™ +... +6124 + bo 


When x — oo, an indeterminate form = arises. With the same technique as 
before, 


(Ors Om 7 0, ne > 0). 


4.1 Theorems on limits 101 


For example: 


3x23 —27+1 . a? 

lim 5 = lm —>=-o, 
2—+00 C—2z xLZ—+co —Z@ 

i =A? 49g? = 7 —Ar? il 

mn US 1m = 5 
a>—oo 89° —a4+5xr 23-0 875 2 

i 6x? —x+5 _ 6a* _ 9 
rasa —7? +9 00 —x73 7 


; sin . . . 
iv) The function y = —— becomes indeterminate 7 for x — 0; we proved in part 
x 


i), Examples 4.6 that y converges to 1. From this, we can deduce the behaviour 


l= cos ¢ 
of y = ——;— as x — 0, another indeterminate form of the type S. In fact, 
Ua 
. l—-cosx ,, (1l—cosz)(1+cosz) ,. 1—cos?z ,,. 
lim. —— = lim AS = _ lim ——— _-: lim ———_.. 
z30 862 a0 x?(1+ cos 2) 20 x z>0 1+ cosx 


The fundamental trigonometric equation cos? x + sin? x = 1 together with The- 


orem 4.10 gives 
a?) 4 2 . 2 
_ sin’s . sin x _ sing 
lim 5 = lim = | lim =1. 
z>0 2 xz—0 xv z>0 2£ 


The same theorem tells also that the second limit is 5, 


so we conclude 


With these examples we have taken the chance to look at the behaviour of 
elementary functions at the boundary points of their domains. For completeness we 
gather the most significant limits relative to the elementary functions of Sect. 2.6, 
their proofs may be found in Appendix A.2.2, p. 435. 


lim x#* =+o0, 
RCO 


lin ost), lim 
L—- +00 xr—0T 


; Ant” +...+4a,;2+ a9 Qn a nae 
lim =—WW——_ = — lim 2 
iS) b,,0 + Cee + b1x + bo bm L—>xCO 


lim a* =+0o0, lim .a* =0 a 
xL—>+00 L—>—0o 


lim a — 0, lm a” =+oo a<l 
x—>+00 xL—>— 00 


lim log, 7 —==-oo., lim log, 2 =—oco a > 1 
L—-+ + 0O x—0T 


lim log. . —— 30, lim log, x = +ooa<1 
T+ 0Co xr—0T 


102 4 Limits and continuity I 


lim sing, lim cosa, lim tanz do not exist 
L—-=xCO Lx CO L—7a=OO 


lim tanz=7Foo, VWkeZ 


o—( E+kr) - 


lim | arcsin x = +o = arcsin(+1) 


di—-Se 


ling arceds7,— 0) — arecos | lim arccos xz = m7 = arccos(—1) 
gsr P= 


L— Co 


lim arctanz = 


4.1.4 Substitution theorem 


The so-called Substitution theorem is important in itself for theoretical reasons, 
besides providing a very useful method to compute limits. 


Theorem 4.15 Suppose a map f admits limit 


He ea (4.9) 


LC 


finite or not. Let g be defined on a neighbourhood of £ (excluding possibly the 
point £) and such that 


i) if€ER, g is continuous at é; 


ti) of = +00 or £ = —oo, the limit lim g(y) exists, finite or not. 
yo 


Then the composition go f admits limit for x > c and 


lim g(f(x)) = lim g(y). 


xm—>c yok 


Proof. Set m= lim g(y) (noting that under i), m = g(@) ). Given any neighbour- 
7 ie 


hood I(m) of m, by 2) or 2) there will be a neighbourhood J(¢) of @ such 
that 
Vy € domg, yell) => gly) €lI(m). 


Note that in case i) we can use J(¢) instead of I(@) \ {@} because g is 
continuous at @ (recall (3.7)), while 2 does not belong to I(@) for case ii). 
With such J(), assumption (4.9) implies the existence of a neighbourhood 
I(c) of c with 


Va € dom f, x elI(c)\{c} = f(x) €l(é). 


4.1 Theorems on limits 103 


Since x € domgo f means x € dom f plus y = f(x) € domg, the previous 
two implications now give 


Va € domgo f, xel(c)\{c} = g(f(x)) €I(m). 


But [(m) was arbitrary, so 


lim g(f(x)) =m. 


H i aS 


Remark 4.16 An alternative condition that yields the same conclusion is the 
following: 


i’) if 2 € R, there is a neighbourhood I(c) of c where f(x) 4 @ for all x  c, and 
the limit lim, g(y) exists, finite or infinite. 
yY 


The proof is analogous. O 


In case  € R and g is continuous at @ (case i) ), then lim g(y) = g(2£), so (4.10) 
y 


reads 


lim g(f(x)) = g(lim f(2)). (4.11) 


jb Ae bb ae 


An imprecise but effective way to put (4.11) into words is to say that a continuous 
function commutes (exchanges places) with the symbol of limit. 


Theorem 4.15 implies that continuity is inherited by composite functions, as 
we discuss hereby. 


Corollary 4.17 Let f be continuous at xo, and define yo = f(xo). Let fur- 


thermore g be defined around yo and continuous at yo. Then the composite 
go f is continuous at xo. 


Proof. From (4.11) 


lim (go f)(x) = g( lim f(x)) = g(f(#o)) = (9° f)(#o), 


Lr Xo Lo 


which is equivalent to the claim. 


A few practical examples will help us understand how the Substitution theorem 
and its corollary are employed. 


Examples 4.18 


i) The map h(x) = sin(x?) is continuous on R, being the composition of the 
continuous functions f(x) = x? and g(y) = siny. 


104 4 Limits and continuity I 


ii) Let us determine 


i ny) 
xr—0 aG 
Set f(x) = x? and 
"ify 40, 
g(y) = 
1 ify =0. 
Then lim, f(x) = 0, and we know that g is continuous at the origin. Thus 
x 
ea, 
z>0 862 y>0 = Yy 


i) 
iii) We study the behaviour of h(x) = arctan (—) around the point 1. 


1 
Defining f(x) = we have lim f(x) = too. If we call g(y) = arctany, 


xr — il rls 


lim g(y) = +5 (see the Table on page 101). Therefore 


YOO 


mw] A 


1 
lim arctan (—) = lm gy) 
z—1+ x—1 y— E00 


iv) Determine 
1 
lim logsin—. 
xr%—+oo x 


Setting f(x) = sin + has the effect that @ = lim f(x) = 0. Note that f(x) > 0 
w—->>+00 
for all > 4. With g(y) = log y we have lim g(y) = —oo, so Remark 4.16 yields 
yO 


1 
lim logsin— = li = —0o. 
_ lim log sin — jim, oy) 


Remark 4.19 Theorem 4.15 extends easily to cover the case where the role of f 
is played by a sequence a:n+> ay, with limit 
lim a, = £. 


n> Cco 


Namely, under the same assumptions on g, 
lim g(a@n) = lim g(y). 
nN—- Ooo ye 
This result is often used to disprove the existence of a limit, in that it provides a 
Criterion of non-existence for limits: if two sequencesa:nt> an, b: n> by 
have the same limit € and 


lim g(an) # lim g(bn), 
n> oco N—- CO 


then g does not admit limit when its argument tends to &£. 


4.2 More fundamental limits. Indeterminate forms of exponential type 105 


For example we can prove, with the aid of the criterion, that y = sinx has no 
limit when x — +oo: define the sequences ay = 2n7 and bn = 5 + 2n7, n € N, so 
that 


lim sina, = lim 0=0, and at the same time lim sinb, = lim 1=1. 
n—0o n—0o noo n—-co 


Similarly, the function y = sin 4 has neither left nor right limit for « > 0. 


4.2 More fundamental limits. Indeterminate forms of 
exponential type 


1 n 
Consider the paramount limit (3.3). Instead of the sequence a, = (1 + = , we 
n 


look now at the function of real variable 


h() = (+2). 


It is defined when 1 + + > 0, hence on (—oo, —1) U (0, +00). The following result 
states that h and the sequence resemble each other closely when x tends to infinity. 
Its proof is given in Appendix A.2.3, p. 439. 


Property 4.20 The following limit holds 


il ay 
lim (1 + =) =e. 
T2200 OG 


By manipulating this formula we achieve a series of new fundamental limits. 


The substitution y = =, with a 4 0, gives 


aye i ie 1 
lim (1 + “) = lm {1+- =] lim [{[1+- =e, 
L—CO GG YOO Yy Y— LOCO Yy 


In terms of the variable y = 4 then, 


1/x _ 


1 ¥y 
lim (1+<2)"" = lim (1 + | =e. 
x0 yoo y 


The continuity of the logarithm together with (4.11) furnish 


= lim log, (1+2)'/” = log, lim (1+ 2)'/” = log, e = 
x—0 x—0 


x—0 x 


for any a > 0. In particular, taking a = e: 


106 4 Limits and continuity I 


en log(1 + x) 
xz—-0 Li 


= Il. 


Note by the way a” — 1 = y is equivalent to x = log, (1+ y), and y > Oif « > 0. 
With this substitution, 


al log, (1 7 
ee i ee ae Oe | ie (4.12) 
c30 2 y>0 log, (1 + y) y—0 y 


Taking a = e produces 


Eventually, let us set 1+ 2 = e¥. Since y > 0 when x — 0, 


. (+a)*-1 ~ et —] . ey—-l y 
lim —————— = lim = lm 
«z—0 He y70 ey — 1 y>0 yy evy—1 
(4.13) 
(e7 )# eg: a 
= lim lim = loge* =a 
yO Yy y0 eY — 


for any aE R. 


For the reader’s conveniency, all fundamental limits found so far are gathered 
below. 


log(l x) 
ii 


(a > 0); in particular, lim 
x—0 


: : _.& 
in particular, lim 
x—0 


4.2 More fundamental limits. Indeterminate forms of exponential type 107 


1 1 
Let us return to the map h(x) = (1 + =| . By setting f(x) = (1 ze =) and 
xL x 
g(x) = x, we can write 


h(x) = [f(x)]9. 


In general such an expression may give rise to indeterminate forms for x tending 
to a certain c. Suppose f, g are functions defined in a neighbourhood of c, except 
possibly at c, and that they admit limit for « — c. Assume moreover f(x) > 0 
around c, so that h is well defined in a neighbourhood of c (except possibly at c). 
To understand h it is convenient to use the identity 


f(a) = eles F(z), 


From this in fact we obtain 


h(a) = e9 (2) log F(x), 


By continuity of the exponential and (4.11), we have 


lim [f (0)]2@) = exp (lim (g(x) log f(#)]) . 


LC 


In other words, h(x) can be studied by looking at the exponent g(x) log f(x). 
An indeterminate form of the latter will thus develop an indeterminate form 
of exponential type for h(a). Namely, we might find ourselves in one of these 
situations: 


i) g tends to co and f to 1 (so log f tends to 0): the exponent is an indeterminate 
form of type co - 0, whence we say that h presents an indeterminate form of 
type 


i, 
ii) g and f both tend to 0 (so log f tends to —oo): once again the exponent is of 
type oo -0, and the function h is said to have an indeterminate form of type 


0°. 


iii) g tends to 0 and f tends to +00 (log f — +00): the exponent is of type oo - 0, 
and h becomes indeterminate of type 


00°. 


Examples 4.21 


1 x 

i) The map h(x) = (1 + =) is an indeterminate form of type 1° when x > 
4 

too, whose limit equals e. 


ii) The function h(x) = x”, for x + 0*, is an indeterminate form of type 0°. We 
shall prove in Chap. 6 that lim, x log x = 0, therefore lim, he) =1. 
v— «=> 


108 4 Limits and continuity I 


iii) The function h(x) = a!/* is for 2 + +00 an indeterminate form of type 00°. 
ee : . : log x 
Substituting y = 4, and recalling that log + = — logy, we obtain lim ee 
y Z>+00 


— lim ylogy=0,h lim h(x) =1. 
in iegy 04 hence. im, (x) 


When dealing with h(x) = [f(x)]9, a rather common mistake — with tragic 
consequences — is to calculate first the limit of f and/or g, substitute the map 
with this value and compute the limit of the expression thus obtained. This is to 
emphasize that it might be incorrect to calculate the limit for « — c of the 
indeterminate form h(a) = [f(«)]9™ by finding first 


m = lim g(x), and from this proceed to lim Pac) 


Equally incorrect might be to determine 


lim £9) | already knowing ¢= lim f(z). 


«LC «w—->C 


1 x 
For example, suppose we are asked to find the limit of h(x) = (1 + -) for 
a 


iL 
x — +00; we might think of finding first 2= lim [{1-+-—] = 1 and from this 
i x 


— =0o 


lim 1*= lim 1= 1. This would lead us to believe, wrongly, that h converges 
LOO L—-xr0o 


to 1, in spite of the fact the correct limit is e. 


4.3 Global features of continuous maps 


Hitherto the focus has been on several local properties of functions, whether in the 
neighbourhood of a real point or a point at infinity, and limits have been discussed 
in that respect. Now we turn our attention to continuous functions defined on a 
real interval, and establish properties of global nature, i.e., those relative to the 
behaviour on the entire domain. 

Let us start with a plain definition. 


Definition 4.22 A zero of a real-valued function f is a point xo © dom f 


at which the function vanishes. 


For instance, the zeroes of y = sinx are the multiples of 7, i.e., the elements of 
the set {ma | m € Z}. 


The problem of solving an equation like 


f(x) = 0 


4.3 Global features of continuous maps 109 


is equivalent to determining the zeroes of the function y = f(x). That is why it 
becomes crucial to have methods, both analytical and numerical, that allow to 
find the zeroes of a function, or at least their approximate position. 


A simple condition to have a zero inside an interval goes as follows. 


Theorem 4.23 (Existence of zeroes) Let f be a continuous map on a 
closed, bounded interval |a, b]. If f(a) f(b) < 0, i.e., if the images of the end- 


points under f have different signs, f admits a zero within the open inter- 
val (a, b). 
If moreover f is strictly monotone on [a,b], the zero is unique. 


Fla) | 


Figure 4.5. Theorem of existence of zeroes 


Proof. Throughout the proof we shall use properties of sequences, for which we 
refer to the following Sect. 5.4. Assuming f(a) < 0 < f(b) is not restrictive. 
Define ag = a, bo = 6 and let cp = — be the middle point of the 
interval [ag,bo9|. There are three possibilities for f(co). If f(co) = 0, the 
point xo = co is a zero and the proof ends. If f(co) > 0, we set a1 = ap and 
b; = co, so to consider the left half of the original interval. If f(co) < 0, 
let ay = Co, 61 = bo and take the right half of [ao, bo] this time. In either 


case we have generated a sub-interval [aj, bi] C [ao, bo] such that 


bo — ao 


f(a1) <0 < f(b1) and b; —a,= : 


Repeating the procedure we either reach a zero of f after a finite number 
of steps, or we build a sequence of nested intervals |a,,,b,| satisfying: 


110 4 Limits and continuity II 


[a0, 69] D [a1, 61] D.-. D [an, bn] D..., 
bo — ao 
PAL 


flan) <0< f(b,) and b,-—a,= 


(the rigorous proof of the existence of such a sequence relies on the Prin- 
ciple of Induction; details are provided in Appendix A.1, p. 429). In this 
second situation, we claim that there is a unique point xq belonging to 
every interval of the sequence, and this point is a zero of f. For this, 
observe that the sequences {a,} and {b,} satisfy 


0p = Gh Siae Ss G4 Se SHS oes SO OG: 


Therefore {a,,} is monotone increasing and bounded, while {b,,} is mono- 
tone decreasing and bounded. By Theorem 3.9 there exist x9 , xj € [a, b] 
such that 


lm @,=2% and lim b,= ie 
n—-?Co TF SO 


On the other hand, Example 5.18 i) tells 
fi b—a 


i, —% = jim (bn = an) = im. = 0, 


sop = 24. Let xo denote this number. Since f is continuous, and using 
the Substitution theorem (Theorem 9, p. 138), we have 
lim fig,) = lim f(b.) = 7 (29). 


n—-oco n—- oo 


But f(an) < 0 < f(bn), so the First comparison theorem (Theorem 4, 
p. 137) for {f(an)} and {f(bn)} gives 


lim f(@n)<0 and lim j(6,)> 0. 
n—-0o n—0o 


As 0 < f(ao) < 0, we obtain f(x) = 0. 
In conclusion, if f is strictly monotone on [a,b] it must be injective by 
Proposition 2.8, which forces the zero to be unique. O 


Some comments on this theorem might prove useful. We remark first that 
without the hypothesis of continuity on the closed interval [a,b], the condition 
f(a) f(b) < 0 would not be enough to ensure the presence of a zero. The function 


f: [0,1] >~R 
—1 for z=0, 
re)={ ior 0 ee 1 


takes values of discordant sign at the end-points but never vanishes; it has a jump 
point at a= 0. 

Secondly, f(a) f(b) < 0 is a sufficient requirement only, and not a necessary one, 
to have a zero. The continuous map f(x) = (22 — 1)? vanishes on [0,1] despite 
being positive at both ends of the interval. 


4.3 Global features of continuous maps 111 


Thirdly, the halving procedure used in the proof can be transformed into an al- 
gorithm of approximation, known in Numerical Analysis under the name Bisection 
method. 

A first application of the Theorem of existence of zeroes comes next. 


Example 4.24 


The function f(2) = x4 + 23 —1 on [0,1] is a polynomial, hence continuous. 
As f(0) = —1 and f(1) = 1, f must vanish somewhere on [0,1]. The zero is 
unique because the map is strictly increasing (it is sum of the strictly increasing 
functions y = x4 and y = x°, and of the constant function y = —1). 


Our theorem can be generalised usefully as follows. 


Corollary 4.25 Let f be continuous on the interval I and suppose it admits 
non-zero limits (finite or infinite) that are different in sign for x tending to 


the end-points of I. Then f has a zero in I, which is unique if f is strictly 
monotone on I. 


Proof. The result is a consequence of Theorems 4.2 and 4.23 (Existence of zeroes). 
For more details see Appendix A.3.2, p. 444. 


Example 4.26 


Consider the map f(x) = x+logz, defined on I = (0, +00). The functions y = x 
and y = log are continuous and strictly increasing on J, and so is f. Since 
lim f(z) =-—ooand lim f(x) = +00, f has exactly one zero on its domain. 


Corollary 4.27 Consider f and g continuous maps on the closed bounded 
interval |a, b|. If f(a) < g(a) and f(b) > g(b), there exists at least one point 


xo in the open interval (a,b) with 


f (xo) = g(@o). (4.14) 


Proof. Consider the auxiliary function h(a) = f(a) — g(a), which is continuous in 
[a,b] as sum of continuous maps. By assumption, h(a) = f(a) — g(a) < 0 
and h(b) = f(b) — g(b) > 0. So, h satisfies the Theorem of existence of 
zeroes and admits in (a,b) a point xo such that h(vo) = 0. But this is 
precisely (4.14). 

Note that if h is strictly increasing on [a,b], the solution of (4.14) has to 
be unique in the interval. O 


112 4 Limits and continuity II 


y = g(2) 


i 
I 
| 
| 
I 
I 
| 
| 
ee Hp eee SS a. 
i 
I 
| 
| 


| 
| 
| 
I 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
a XO b 


Figure 4.6. Illustration of Corollary 4.27 


Example 4.28 
Solve the equation 
conn = (4.15) 


For any real x, —1 < cosx < 1, so the equation cannot be solved when x < —1 or 
x > 1. Similarly, no solution exists on [—1,0), because cos x is positive while z is 
negative on that interval. Therefore the solutions, if any, must hide in [0, 1]: there 
the functions f(x) = x and g(x) = cosz are continuous and f(0) = 0 < 1 = g(0), 
f(1) =1 > cos1 = g(1) (cosine is 1 only for multiples of 277). The above corollary 
implies that equation (4.15) has a solution in (0,1). There can be no other 
solution, for f is strictly increasing and g strictly decreasing on [0,1], making 
h(x) = f(a) — g(x) strictly increasing. O 


When one of the functions is a constant, the corollary implies this result. 


Theorem 4.29 (Intermediate value theorem) Jf a function f is continu- 


ous on the closed and bounded interval [a,b], it assumes all values between 


f(a) and f(b). 


Proof. When f(a) = f(b) the statement is trivial, so assume first f(a) < f(b). 
Call z an arbitrary value between f(a) and f(b) and define the constant 
map g(x) = z. From f(a) < z < f(b) we have f(a) < g(a) and f(b) > g(b). 
Corollary 4.27, applied to f and g in the interval [a,b], yields a point xo 
in [a, 6] such that f(xo) = g(xo) = z. 
If f(a) > f(b), we just swap the roles of f and g. O 


The Intermediate value theorem has, among its consequences, the remarkable 
fact that a continuous function maps intervals to intervals. This is the content of 
the next result. 


4.3 Global features of continuous maps 113 


| 
} 
| 
| 
} 
I 
} 
| 
| 
| 
I 
| 
| 
| 
| 
f 

a xo b 


Figure 4.7. Intermediate value theorem 


Corollary 4.30 Let f be continuous on an interval I. The range f(I) of I 


under f is an interval delimited by inf; f and sup, f. 


Proof. A subset of R is an interval if and only if it contains the interval [a, 6] as 
subset, for any a < . 
Let then y1 < y2 be points of f(I). There exist in I two (necessarily dis- 
tinct) pre-images x1 and 29, i.e., f(a1) = yi, f(v2) = yo. If J C I denotes 
the closed interval between x; and x2, we need only to apply the Intermedi- 
ate value theorem to f restricted to J, which yields [y1, yo] C f(J) C f(Z). 
The range f(J/) is then an interval, and according to Definition 2.3 its 
end-points are inf; f and sup, f. 0 


Either one of inf; f, sup; f may be finite or infinite, and may or not be an 
element of the interval itself. If, say, inf; f belongs to the range, the function 
admits minimum on I (and the same for sup, f). 


In case J is open or half-open, its image f (J) can be an interval of any kind. Let 
us see some examples. Regarding f(x) = sinx on the open bounded J = (—$, $), 
the image f(/) = (—1,1) is open and bounded. Yet under the same map, the image 
of the open bounded set (0,27) is [—1,1], bounded but closed. Take now f(x) = 


tan: it maps the bounded interval (—5, 4) to the unbounded one (—oo, +00). 
Simple examples can be built also for unbounded I. 
But if I is a closed bounded interval, its image under a continuous map cannot 


be anything but a closed bounded interval. More precisely, the following funda- 
mental result holds, whose proof is given in Appendix A.3.2, p. 443. 


114 4 Limits and continuity I 


Theorem 4.31 (Weierstrass) A continuous map f on a closed and bounded 
interval [a,b] is bounded and admits minimum and mazimum 


m= min 7 (2) and M — max {(z): 
xE[a,] xe [a,b] 


Consequently, 
f (la, b]) = [m, M]. 


Figure 4.8. The Theorem of Weierstrass 


In conclusion to this section, we present two results about invertibility (their 
proofs may be found in Appendix A.3.2, p. 445). We saw in Sect. 2.4 that a strictly 
monotone function is also one-to-one (invertible), and in general the opposite im- 
plication does not hold. Nevertheless, when speaking of continuous functions the 
notions of strict monotonicity and injectivity coincide. Moreover, the inverse func- 
tion is continuous on its domain of definition. 


Theorem 4.32 A continuous function f on an interval I is one-to-one if 
and only if it is strictly monotone. 


Theorem 4.33 Let f be continuous and invertible on an interval I. Then 
the inverse f—' is continuous on the interval J = f (I). 


Theorem 4.33 guarantees, by the way, the continuity of the inverse trigonomet- 
ric functions y = arcsinz, y = arccosx and y = arctanz on their domains, and 
of the logarithm y = log, x on R + as well, as inverse of the exponential y = a”. 
These facts were actually already known from Proposition 3.20. 


4.4 Exercises 115 


Figure 4.9. Graph of a continuous invertible map (left) and its inverse (right) 


4.4 Exercises 


1. Compute the following limits using the Comparison theorems: 


; cos x : . 
a) dim ar: b) dim (/z@ + sin z) 
lm 24 —sing ae [x] 
a—+—oo 32+ coszx r++oo L 
oo, = il | g£-—tane 
e) lim sing - sin — lim 3 
2. Determine the limits: 
4 3 
— “2 — 22° + 5a : xr+3 
a) be 2 — 2 ») ae x? —2r4+5 
. e+aettea : on? + 5a = 7 
D2 > Le d) ili a, ae 
r>—o 274-24 +3 x—++oo 544 — 24 +3 


e+1 
ia Oy). ean 
é-1a/6pe 8 Loe eo ed 
g) im (Vz+1- V2) h) lim Ete 
i) lim (¥e+1—- ¥e-1) Fete oe 
L—>—00 a>—-coo 47 4+2 


3. Relying on the fundamental limits, compute: 


sin* x . xztanx 
b) lim ———— 
r>0 2 x0 1—cosz 


4, 


5. 


4 Limits and continuity I 


_ sin2x —sin3z 
im ——— 


x0 Ax 


tan x — sinz 


ki 

¢) 6 x3 
COs ar 

_ 1-<z 
cosx +1 


i) lim —— 
zn cos3x+1 
Calculate: 


2 


in 

Qe2% — J 

li —— 
jin, 

a 


Compute the limits: 


a) lim 


i) i 1 1 
i im = 
e<>0\etanx xsing 


lim «(2+ sin) 


t—+00 


cos(tan x) — 1 
m ee 
x—0 tan x 
sinz —1 
e+8 (F —2)° 
V1l+tanz—/1—tanz 
m ——EEEEESSs °C" 


r—0 sin x 


r e2t _] 
1m 
zr 0 e8t — ] 


e” 


lim 


at—+too et — ] 


. loge 
lim 
z->l1le*™ —e 


r xt+1 
in —— 
a>-1 Woe +17—2 


lim a ae 7 eciaelad 


37 —3 ” 
lim ——_——— 


: . ee 
lim ze” sin |e” sin — 
xL—-+00 rT 


lim ae™"* 
«—>— oo 


6. Determine the domain of the functions below and their limit behaviour at the 


end-points of the domain: 


eo = 2743 
a = v2 4+ 3242 


= log [1 + exp ( 


¢? 4-1 


»)| f@= =o 


d) f(«) = Vxe™ 


2 


4.4 Exercises 


4.4.1 Solutions 


1. Limits: 

&) 0; b) +00. 

c) We have . 
_. 2e— sing _ «@(2-222) Q 
a ee i : SS Se 

t+—oo 84+ cCosz t>-c x (34+ SS#) 8 
because lim a a 0 by Corollary 4.7. 
@L>—-co 6 6C L—-—0o 


117 


d) From [a] < x < [x] +1 (Example 2.1 vii)) one deduces straightaway « — 1 < 


[x] < x, whence 
—1 
= < [2] < 1 
5 x 


for x > 0. Therefore, the Second comparison theorem 4.5 gives 


[2] 


lim —=1. 
g—+>+co 7 
e) 0. 
—t 
f) First of all f(z) = aS is an odd map, so lim f(x) =— lim f(a). Let 
x z—0r x07 


now0<a< a From 
sing <a <tanz 


(see Example 4.6 i) for a proof) it follows 


sing —tanzx < 2-—tanz < 0, that is, 
7 x? 
Secondly, 
. sing —tanz . sina (cosa — 1) . sina cosa —1 
im ———, = ie, = li, er? 
a—0t x xz—0t XL* COS X z—0+ COSZ x 


Thus the Second comparison theorem 4.5 makes us conclude that 


therefore the required limit is 0. 


2. Limits: 
a) —5; b) 0. 


sin x — tanz x —tane 


<0. 


118 4 Limits and continuity I 


c) Simple algebraic operations give 


. et+art+e a(l+24+3 
lim = lim 7 = lm = =-o 
zt 3—oo Qn? — +3 L4-0o 7 ( = =) ~L—>—0o 
d) 2 
e) Rationalising the denominator we see 
r+1 a (2 + 1)(V6x? + 3 — 3z) 
= 1 a 


in ——$__—_- 
Perens (672 +343 2-1 6x? + 3 — 9x? 
(2 + 1)(V6x? + 3 — 32) 


= li = 1 . 
Parone 3(1—2)(1+2) 
f) Use the relation a? — b? = (a — b)(a? + ab +b?) in 
i 710 —x —2 i 10-—x2-8 
mA. 2S = i SHH! 
e2 £-2 x2 (x — 2)(¥/(10 — 2)? + 2/10 — 2+ 4) 
—l 1 
= in ————————— 
22 S/10—a2)2+2V10—-x+4 12 
g) 0; h) 1; i) 0. 
0) We have 
2a? +3 _ Inv2+or V2 ln we 7) 
x>—oo Ag+2 ~—>—0o a (4+ 2) 4 «wo-30 2 — 4 
3. Limits: 
a) 0; b) 2. 
c) We manipulate the expression so to obtain a fundamental limit: 
.  sin2x% — sin 3x . sin2s . sinder 1 3 1 
lim. ————————. = lim — lim = -—--=--. 
xr—0 Ar x>0 Agr c>0 Agr 2 #4 4 
d) We use the cosine’s fundamental limit: 
1- 1— 1 1 1 
ie ON stip ONE ig, ae ait poe 
x—0t Qn x—0F x x—>0t+ 24 2 230+ 2a 
e) 5. 
f) Putting y = tanz and substituting, 
-—1 —1 —1 
i cos(tan x) — jim £084 — tim £084 pty, 


230 tan x yO Yy y0 uy 


4.4 Exercises 


¢) Letting y = 1 —< transforms the limit into: 


cos 4 cos 4(1 — sin 4 
rane aay 2 Y) _ i; 27 
col 1-2 y—0 y yoo Yy 2 
hy 3; i) 3. 
¢) One has 
r V1l+tanz—/1l—tanz r l+tanz—1+tanz 
es tm MR 
20 sin x x0 sin x (V1 +tanz++/1-—tan x) 
1 2 1 
= ~ lim ee =, 
22-0 sinz x0 COSZ 
4. Limits: 
a) logs b) . 


c) By defining y = x — e we recover a known fundamental limit: 


— — ] 1 —] 
loga — 1 = log(y+e)—-1 oe oge(1+ y/e) 


lim li 
moe g-—e yO Yy y—0 Yy 
log(1 1 
— tim os ty/e) _ 1 
y0 Yy e 


Another possibility is to set z = a/e: 


loga —1 . log(ez)-—1 1,. logz 1 
m ——— = lim —— = im = 5 


li 


re xe-—e Zz—>1 e(z — 1) Teel as 1 e 
ay. 
e) We have 
20 25° 1 1 
lim 2 = lm ees aes is 
20+ 22 x—Ot 22 
22 __ 1 1 
= lim 2° 4+ lim — =24+ lim —=+00. 
20+ 22x z30+t 2x z—>0+ 27 
f) Substitute y = x — 1, so that 
. loga ; log x 
li = lim 
r71e* —e rl e(er—1 _ 1) 
logd+y) 1, losd+y) yy _ 1 
~ y30 e(eY—1) ey y ev—-1l e 


119 


120 4 Limits and continuity I 


h) The new variable y = x + 1 allows to recognize (4.13), so 


: c+1 . Yy ; Yy 
in ———_. = lin ——_ = Lin ———_— 
e>-1 Ye+17—-2 90 YyFI6—-2 402(4/114+ Z-1) 
16... y/16 
= — lim ——.——_ = § -4= 32. 
2 y0 vile eee 
5. Limits 
a) 5. 
b) We have 
x —2 =f [ 42e Qa 
— e e 1 —1 
ee : = lim (« Vex tieee . = =? 
x30 sing x0 sin x x0 22 sin x 
c) One has 
1 —1 —1 
lim (cotanz— - ) =| — = lim —* . * -c=0. 
z—0 sin x z>0 sing z—0 x sin x 


1 ik 
Now define y = ——, and substitute 7 = — — 3 at the exponent: 
e+3 y 


1 log (1 — 4 
L= lim (- Z 5) log (1 — 4y) = lim Gare —5log (1 -4y)) Sah. 
yor \Yy y—ot y 
The required limit equals e~*. 
f) e; g) 2/5. 
h) We have 
L_ Q-2 37-2 32a —] 
x+—oo BX 1 37% xr——oco 3-2 (3°? + 1) 
i) —}. 


¢) Start by multiplying numerator and denominator by the same function: 


; ie sy (e~* sin 2) ; : ; 
lim ge*e ” sin — + ————_—_—_, = = lim #sin—+ lim - 
x—>+oo x e—* sin x—>+oo XY «£—>+00 e-* sin 


sin (e~” sin 2) 


= Jy Le. 


n) 


4.4 Exercises 121 


Now put y = + in the first factor to get 


x 


sin 2 
he lim 2 = 
yoor = =Yy 


2 7 


v] 


next, let t = e~* sin 2. Since t + 0 for x + +00, by Corollary 4.7, the second 
factor is 


and eventually the limit is 2. 

The fact that —1 < sinx < 1 implies 1 < 2+sinz < 3, soz < x(2+sinz) 

when x > 0. Since lim x = +00, the Second comparison theorem 4.8 gives 
w—->+00 


+oo for an answer. 
—oOo. 


6. Domains and limits: 


a) 


b) 


c) 


d) 


dom f = R \ {-2,-1}, 


lim fie) =a00, lim f(a) bos; lim. (4) =e 
ro —Qs e>—1= LOCO 


The function is defined on the entire R and 


_ ao. & oe. ev 
BT) Oe pt Tat athe gt = T° 


This function makes sense when x 4 0 (because 1 + exp (<4) > 0 for any 


x 


non-zero x). As for the limits: 


ede 
lim f(x) =log lim (1+ exp € a )) —log1=0, 
T—+— 00 t—>— co xv 
xe? +1 
li =| li 1 = 
lim, f(z) og tim ( + exp ( ; )) +00 , 
241 
lim f(x) =log lim (1+ exp € a )) = log1=0, 
xz—-0- x07 x 
oe cal 
lim f(z) =log lim (1+ exp (= = )) = +00 
x—0t x£—0+ x 


dom f = R; iim f(2)=0. 


5 


Local comparison of functions. Numerical 
sequences and series 


In the first part of this chapter we learn how to compare the behaviour of two 
functions in the neighbourhood of a point. To this aim, we introduce suitable 
symbols — known as Landau symbols — that make the description of the possible 
types of behaviour easier. Of particular importance is the comparison between 
functions tending to 0 or oo. 

In the second part, we revisit some results on limits which we discussed in 
general for functions, and adapt them to the case of sequences. We present specific 
techniques for the analysis of the limiting behaviour of sequences. At last, numer- 
ical series are introduced and the main tools for the study of their convergence are 
provided. 


5.1 Landau symbols 


As customary by now, we denote by c one of the symbols xo (real number), xq, 
Lp , or +00, —oo. By ‘neighbourhood of c’ we intend a neighbourhood — previously 
defined — of one of these symbols. 

Let f and g be two functions defined in a neighbourhood of c, with the possible 
exception of the point c itself. Let also g(x) #4 0 for x #£c. Assume the limit 


exists, finite or not. We introduce the following definition. 


Definition 5.1 If ¢ is finite, we say that f is controlled by g for x tending 
to c, and we shall use the notation 


f=O(9), «¢-e, 


read as ‘f is big o of g for x tending to c’. 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_5, 
© Springer International Publishing Switzerland 2015 


124 5 Local comparison of functions. Numerical sequences and series 


This property can be made more precise by distinguishing three cases: 


a) If & is finite and non-zero, we say that f has the same order of mag- 
nitude as g (or is of the same order of magnitude) for x tending to 
c; if so, we write 
fos LC. 


As sub-case we have: 
b) Iff=1, we call f equivalent to g for x tending to c; in this case we use 
the notation 
SG LC. 


c) Eventually, if = 0, we say that f is negligible with respect to g when 
x goes to c; for this situation the symbol 


f =0(9), oa C, 


will be used, spoken ‘f is little o of g for x tending to c’. 


Not included in the previous definition is the case in which @ is infinite. But in 
such a case 
im — 
zc (x) L 
so we can say that g = o(f) for r > c. 
The symbols O, =<, ~, o are called Landau symbols. 
Remark 5.2 The Landau symbols can be defined under more general assump- 
tions than those considered at present, i.e., the mere existence of the limit (5.1). 


For instance the expression f = O(g) as x — c could be extended to mean that 
there is a constant C > 0 such that in a suitable neighbourhood I of c 


If(x)| <Clg(z)|, Veer, r¥e. 


The given definition is nevertheless sufficient for our purposes. 


Examples 5.3 


i) Keeping in mind Examples 4.6, we have 


: ; . sing 
sinz~ x, © > 0, in fact lim = 1, 
z>0 
: . ; sin 
sinz = o(z), x +00, since lim = 0; 


g++oo Ff 
ii) We have sinz = o(tanz), x + § since 
sin & 


lim = lim cosz = 0. 
e>%tanv «£5 


5.1 Landau symbols 125 


iii) One has cosx x 2a — 7, x —> §, because 


cosa _ cos(t + $) 
lim = lim ———— = — —— , 
a>%2x—T 8 t-0 2 t0 2¢ 2 


Properties of the Landau symbols 


i) It is clear from the definitions that the symbols x, ~, o are particular instances 
of O, in the sense that 


fxg>f=O(g), fr~g> f=O(g), f =o(g) > f = O(9) 


for x — c. Moreover the symbol ~ is a subcase of < 


f-g => fg. 
Observe that if f < g, then (5.1) implies 


lim F(z) 
LC g(x) 


=, hence f ~ @g. 
ii) The following property is useful 
frg = f =g+ o(g). (5.2) 


By defining h(x) = f(x) — g(a) in fact, so that f(x) = g(x) + h(x), we have 


. my £2) _ mn (#03) _ 
pa ee 8 a 
=> tim 3} => f=0(9), 


o(Af)=o(f) and Ao(f) = o(f). (5.3) 
In fact g = o(Af) means that lim a2) = 0, otherwise said lim g(a) = 0 
zc (x) : LC f(z) : 


or g = o(f). The remaining identity is proved in a similar way. Analogous 
properties to (5.3) hold for the symbol O. 
Note that o(f) and O(f) do not indicate one specific function, rather a precise 
property of any map represented by one of the two symbols. 
iv) Prescribing f = o(1) amounts to asking that f converge to 0 when x > c. 
Namely 
lim f(x) = lim ie) ='(, 


w—->C LC 


126 5 Local comparison of functions. Numerical sequences and series 


Similarly f = O(1) means f converges to a finite limit for x tending to c. 
More generally (compare Remark 5.2), f = O(1) means that f is bounded in 
a neighbourhood of c: that is to say, there exists a constant C’ > 0 such that 


f(a) <C, Veer, r#e, 


I being a suitable neighbourhood of c. 
v) The continuity of a function f at a point xo can be expressed by means of the 
symbol o in the equivalent form 


f(x) =f(to) +0(1), 220. (5.4) 
Recalling (3.9) in fact, we have 


lim f(x) = f(zo) 


L—+>XLO 


I 


The algebra of “little o’s” 


i) Let us compare the behaviour of the monomials x” as x —> 0: 
Og ea &=> in: 


In fact 


lim — = lima” ™ =0 if and only if n—m>0. 
x0 vm x0 


Therefore when x — 0, the bigger of two powers of x is negligible. 


ii) Now consider the limit when x — -too. Proceeding as before we obtain 


C= OL"), Wy eo, — n<m. 


So, for x — oo, the lesser power of x is negligible. 


iii) The symbols of Landau allow to simplify algebraic formulas quite a lot when 
studying limits. Consider for example the limit for x — 0. The following prop- 
erties, which define a special “algebra of little o’s”, hold. Their proof is left to 
the reader as an exercise: 


5.1 Landau symbols 127 


Via oe) =o") if y is bounded in a neighbourhood of x = 0; 


tO a Or) 


o(z™)o(x”) = o(a'™*) ; 


[o(a™)|* = ofa"). 


Fundamental limits 


The fundamental limits in the Table of p. 106 can be reformulated using the sym- 
bols of Landau: 


sing 7, x — 0; 


cose =, x — 0; precisely, 1 —cosz ~ ae x — 0; 
log1+2)~ 2, x —+0; equivalently, logx ~ x—1, x1; 
e*—-1~za, x — 0; 


(l+a)*-lwaz, «0. 


With (5.2), and taking property (5.5) c) into account, these relations read: 


sing = «+ 0(2), x0; 
1—cosx = 427 +0(x?), x0, or cosxr =1— gu? + 0(x?), 
log(1 +2) =2#+o0(2), x—0, or logx =x—1+0(x-1), 
e*=14+2+0(2), x — 0; 
(l+2)* =1l+azr+o(x), x0. 


Besides, we shall prove in Sect. 6.11 that: 


(5.6) 


Examples 5.4 
i) From e = 1+t+0(t), t > 0, by setting t = 52 we have e&* =1+52+4 0(5z), 
ie., 2? =1+5xr+0(x), x — 0. In other words e®” —1~ 52, x > 0. 


128 5 Local comparison of functions. Numerical sequences and series 


ii) Setting t = —32? in (1+t)'/? = 1+ 4¢+0(t), t > 0, we obtain (1 —327)1/? = 
1 — 32? + o(—32?) = 1 — 32? + o(2?), « > 0. Thus (1 — 3a?)!/2 —1 ~ —32?, 
o> 0. 

iii) The relation sint = ¢t + o(t), t > 0, implies, by putting t = 22, xsin2x = 
a(2x + o(2x)) = 2x? + o(x”), x > 0. Then zsin2z ~ 227, x > 0. 


We explain now how to use the symbols of Landau for calculating limits. All 
maps dealt with below are supposed to be defined, and not to vanish, on a neigh- 
bourhood of c, except possibly at c. 


Proposition 5.5 Let us consider the limits 


lim f (x)g(z) and lim AC) 


rc ac g(x) 


Given functions f and g such that f ~ f and g~g forx—c, then 


lim f (x)g(x) = lim f(x)9(2), 


L—rE 


From the definition of f ~ f and g ~ g the result follows. The proof of 
(5.8) is completely analogous. 


Corollary 5.6 Consider the limits 


2 eae 
lim (f(x) + fi(z)) (9(2) + gi(x)) and lim AOR). 


If f: = o(f) and gi = o(g) when x > c, then 


lim (f(x) + fil) (ge) + 91 (#)) = lim f(@)q(c), 


5.1 Landau symbols 129 


Proof. Set f = f+ f1; by assumption f = f + 0(f), so from (5.2) one has f ~ f. 
Similarly, putting g = g+ gi yields g ~ g. The claim follows from the 
previous Proposition. O 


The meaning of these properties is clear: when computing the limit of a product, 
we may substitute each factor with an equivalent function. Alternatively, one may 
ignore negligible summands with respect to others within one factor. In a similar 
way one can handle the limit of a quotient, numerator and denominator now being 
the ‘factors’. 


Examples 5.7 
i) Compute 


230 sin? 3x 
From the equivalence 1 — cost ~ st, t — 0, the substitution t = 2x gives 
1—cos22~ 227, 20. 
Putting t = 3z in sint ~ t, t > 0, we obtain sin 3a ~ 32, x — 0, hence 
sin? 32 ~ 927, «0. 
Therefore (5.8) implies 


l—cos22 .. 2x7 2 
in atari at 
ii) Evaluate 
sin 27 + x3 


ay 4x + 5log(1 + x?) 
We shall show that for x + 0, x° is negligible with respect to sin 2x, and similarly 
5 log(1 + x”) is negligible with respect to 4%. With that, we can use the previous 
corollary and conclude 
lm sin 2x + x = sin 2x i‘ 
c304¢ + 5log(1+2?) «30 42 2 
Recall sin 2x” ~ 2x for x — 0; thus 


lim — = 
z>0sin2x «30 2x 
that is to say 2° = o(sin2zx) for x + 0. On the other hand, since log(1+t) ~ t 


for t > 0, writing t = x? yields log(1 + x”) ~ x? when x > 0. Then 


l 1 2 2 
fia ee) ig 
x—0 Ax x +0 4x 
ie., 5log(1 +27) = 0(4z) for x > 0. Oo 


These ‘simplification’ rules hold only in the case of products and quotients. 
They do not apply to limits of sums or differences of functions. Otherwise put, 


the fact that f ~ f and g ~ g when x — c, does not allow to conclude that 


130 5 Local comparison of functions. Numerical sequences and series 


lim [f (x) + g(«)] = lim[f (x) + g(@)]. 


xw—->C xr—->c 


For example set f(x) = Vx? + 2x and g(x) = Vx? — 1 and consider the limit 


lim (Ve? +22 — Vx? — De 


x—+00 


Rationalisation turns this limit into 


(x? + 2x) — (x? —1) 7 22+1 


lim ee = im = 1. 
too via? + da + Van® — 1 rote (1+ 244/1- 4) 


Had we substituted to f(x) the function f(z) = x, equivalent to f for x > +00, 
we would have obtained a different limit, actually a wrong one. In fact, 


x? — (x? —1) 1 
lm (2#-V/22-1)= lim —————= lim ——— ——— =0. 
om. = ee eave ay A) 


The reason for the mismatch lies in the cancellation of the leading term x? ap- 
pearing in the numerator after rationalisation, which renders the terms of lesser 
degree important for the limit, even though they are negligible with respect to x? 
for x > +00. 


5.2 Infinitesimal and infinite functions 


Definition 5.8 Let f be a function defined in a neighbourhood of c, except 
possibly atc. Then f is said infinitesimal (or an infinitesimal) at c if 


i.e., if f =o(1) forx > c. The function f is said infinite at c if 


Lom (a oo 


see 


Let us introduce the following terminology to compare two infinitesimal or 
infinite maps. 


Definition 5.9 Let f, g be two infinitesimals at c. 
If fxg forx—c, f and g are said infinitesimals of the same order. 
If f =o(g) forx > c, f is called infinitesimal of bigger order than g. 


Ifg=o(f) forx > c, f is called infinitesimal of smaller order than g. 
If none of the above are satisfied, f and g are said non-comparable infin- 
itesimals. 


5.2 Infinitesimal and infinite functions 131 


Definition 5.10 Let f and g be two infinite maps at c. 
If f xq forx—c, f and g are said to be infinite of the same order. 
If f =o(g) forx > c, f is called infinite of smaller order than g. 


Ifg=o(f) forx > c, f is called infinite of bigger order than g. 
If none of the above are satisfied, the infinite functions f and g are said 
non-comparable. 


Examples 5.11 


Bearing in mind the fundamental limits seen above, it is immediate to verify the 
following facts: 
i) e* — 1 is an infinitesimal of the same order as x at the origin. 


ii) sina? is an infinitesimal of bigger order than x at the origin. 
sin 1 

iii) __ ait is infinite of bigger order than — at the origin. 
(1 — cosa)? Hs 


iv) For every a > 0, e® is infinite of bigger order than x® for « > +00. 
il 

v) For every a > 0, log is infinite of smaller order than — for a — OF. 
is 


vi) The functions f(x) = xsin+ and g(x) = @ are infinitesimal for x tending 

to 0 (for f recall Corollary 4.7). But the quotient f(z) = sin + does not admit 
g(a 

limit for x — 0, for in any neighbourhood of 0 it attains every value between —1 

and 1 infinitely many times. Therefore none of the conditions f = g, f = o(g), 

g = 0(f) hold for > 0. The two functions f and g are thus not comparable. 0 


Using a non-rigorous yet colourful language, we shall express the fact that f 
is infinitesimal (or infinite) of bigger order than g by saying that f tends to 0 (or 
oo) faster than g. This suggests to measure the speed at which an infinitesimal (or 
infinite) map converges to its limit value. 

For that purpose, let us fix an infinitesimal (or infinite) map y defined in a 
neighbourhood of ¢ and particularly easy to compute. We shall use it as term of 
comparison (‘test function’) and in fact call it an infinitesimal test function 
(or infinite test function) at c. When the limit behaviour is clear, we refer to 
y as test function for brevity. The most common test functions (certainly not the 
only ones) are the following. If c = x € R, we choose 


p(x) =x — xo or p(x) = |x — xo 
as infinitesimal test functions (the latter in case we need to consider non-integer 
powers of y, see later), and 


1 1 


v(x) = or = (x) = ——_ 
© — £0 ep | 


132 5 Local comparison of functions. Numerical sequences and series 


as infinite test functions. For c = 2j (c = 29 ), we will choose as infinitesimal test 


function 
p(x) = — XH (p(x) = Lo — x) 
and as infinite test function 


1 1 


p(x) = — (v(x) = —— ). 
6 ZO XO w 


For c = +o0, the infinitesimal and infinite test functions will respectively be 


1 
go) =— and ola) =x, 
x 
while for c = —oo, we shall take 


oe) =F aed’ elite) 


The definition of ‘speed of convergence’ of an infinitesimal or infinite f depends 
on how f compares to the powers of the infinitesimal or infinite test function. To 
be precise, we have the following definition 


Definition 5.12 Let f be infinitesimal (or infinite) at c. If there exists a real 
number a > 0 such that 


a (1) 


the constant a is called the order of f at c with respect to the infinites- 
imal (infinite) test function y. 


Notice that if condition (5.11) holds, it determines the order uniquely. In the 
first case in fact, it is immediate to see that for any 6 < a one has f = o(y’), 
while 6 > a implies yp? = o(f). A similar argument holds for infinite maps. 


If f has order a at c with respect to the test function y, then there is a real 


number ¢ 4 0 such that 
an f(2) 
im 


rc po (x) 


=. 


Rephrasing: 
LAG > Be, 
which is to say — recalling (5.2) — f = £y* + o(fy%), for « > c. For the sake of 


simplicity we can omit the constant ¢ in the symbol o, because if a function h 
satisfies h = o(€p%), then h = o(y%) as well. Therefore 


f=lp*+o(y"), re. 


5.2 Infinitesimal and infinite functions 133 


Definition 5.13 The function 


(5.12) 


is called the principal part of the infinitesimal (infinite) map f at c 
with respect to the infinitesimal (infinite) test function y. 


From the qualitative point of view the behaviour of the function f in a small 
enough neighbourhood of c coincides with the behaviour of its principal part (in 
geometrical terms, the two graphs resemble each other). With a suitable choice of 
test function y, like one of those mentioned above, the behaviour of the function 
fyp*(x) becomes immediately clear. So if one is able to determine the principal 
part of a function, even a complicated one, at a given point c, the local behaviour 
around that point is easily described. 


We wish to stress that to find the order and the principal part of a function f 
at c, one must start from the limit 


f(a) 
wre p(x) 


and understand if there is a number a for which such limit — say @ — is finite and 
different from zero. If so, a is the required order, and the principal part of f is 
given by (5.12). 


Examples 5.14 


i) The function f(x) = sina — tan is infinitesimal for « — 0. Using the basic 
equivalences of p.127 and Proposition 5.5, we can write 


sin x (cos x — 1 g- (—s2? 1 
fogs ee ie x — 0. 
cos 1 2 
It follows that f(x) is infinitesimal of order 3 at the origin with respect to the 
test function y(x) = a; its principal part is p(x) = —te3. 


ii) The function 


jja(e en 


is infinitesimal for x + +oo. Rationalising the expression we get 
(x? +3) — («? - 1) 4 


 ————  — 


Ver 8t VERT o (f+ e+ f1-3) 


The right-hand side shows that if one chooses y(x) = + then 
f(x) 


lim 2% = 
2+ too O(a) 


134 5 Local comparison of functions. Numerical sequences and series 


Therefore f is infinitesimal of first order for x — +00 with respect to the test 


function +, with principal part p(x) = 


x 


iii) The function 


f(x) = V9x° + 7x3 — 1 


is infinite when z > +oo. To determine its order with respect to y(x) = x, we 
consider the limit 

5 
(x) ea (94g os 


lim = lim 
t—>+oo 7% @w—+00 ro 


By choosing a = 2 the limit becomes 3. So f has order 3 for x — +00 with 


respect to the test function v(x) = x. The principal part is p(x) = 32°/?. 


Remark 5.15 The previous are typical instances of how to determine the order 
of a function with respect to some test map. The reader should not be mislead to 
believe that this is always possible. Given an infinitesimal or an infinite f at c, and 
having chosen a corresponding test map y, it may well happen that there is no real 
number a > 0 satisfying f = y® for x > c. In such a case it is convenient to make 
a different choice of test function, one more suitable to describe the behaviour of 
f around c. We shall clarify this fact with two examples. 

Start by taking the function f(x) = e?* for + +00. Using (5.6) a), it follows 
immediately that 2° = o(e”), whichever a > 0 is considered. So it is not possible 
to determine an order for f with respect to y(x) = x: the exponential map grows 
too quickly for any polynomial function to keep up with it. But if we take as test 
function y(x) = e® then clearly f has order 2 with respect to y. 


Consider now f(a) = log for x — 0*. In (5.6) d) we claimed that 


li =0 Vv 0. 
b0F + ; B = 
x 
' LOB ays Reta 
So in particular f(«) = 1/ is infinitesimal when x — 07. Using the test function 
x 


(x) = x one sees that 


tlogx _ i log x ={! od. 


im 
a3>0+ xe 20+ go! —oo otherwise. 


Definition 5.9 yields that f is an infinitesimal of bigger order than any power of 
x with exponent less than one. At the same time it has smaller order than x and 
all powers with exponent greater than one. In this case too, it is not possible to 
determine the order of f with respect to x. The function |f(a)| = x| log z| goes to 
zero more slowly than x, yet faster than x«® for any a < 1. Thus it can be used as 
alternative infinitesimal test map when x > OT. 


5.3 Asymptotes 135 


5.3 Asymptotes 


We now consider a function f defined in a neighbourhood of +oo and wish to 
study its behaviour for « + +oo. A remarkable case is that in which f behaves 
as a polynomial of first degree. Geometrically speaking, this corresponds to the 
fact that the graph of f will more and more look like a straight line. Precisely, we 
suppose there exist two real numbers m and q such that 


lim (f(x) —(mzx+q)) =0, (5.13) 


xZ—+00 


or, using the symbols of Landau, 
f(z) =me+q-+o(1), x — +00. 


We then say that the line g(x) = mz+q is aright asymptote of the function f. 
The asymptote is called oblique if m 4 0, horizontal if m = 0. In geometrical 
terms condition (5.13) tells that the vertical distance d(x) = | f(a) — g(a)| between 
the graph of f and the asymptote tends to 0 as x + +00 (Fig. 5.1). 

The asymptote’s coefficients can be recovered using limits: 


(5.14) 
The first relation comes from (5.13) noting that 
O= lim f(z) —ma—q _ lim f(z) _ fe OO ee. = Tas LAO) as 
x—+00 rT x—+00 xv Z—+oo x +oo 7 L—+00 xv 


while the second one follows directly from (5.13). The conditions (5.14) furnish the 
means to find the possible asymptote of a function f. If in fact both limits exist 


I 

I 

i 

I 

i 

i 

i 

I 

| 

| 

i 

| 
x 


Figure 5.1. Graph of a function with its right asymptote 


136 5 Local comparison of functions. Numerical sequences and series 


and are finite, f admits y = mx+q as a right asymptote. If only one of (5.14) is 
not finite instead, then f will not have an asymptote. 

Notice that if f has an oblique asymptote, i.e., if m 4 0, the first of (5.14) 
tells us that f is infinite of order 1 with respect to the test function v(x) = x for 
x —» +00. The reader should beware that not all functions satisfying the latter 
condition do admit an oblique asymptote: the function f(z) = «+ ./z for example 
is equivalent to x for x — +00, but has no asymptote since the second limit in 
(5.14) is +oo. 


Remark 5.16 The definition of (linear) asymptote given above is a particular 
instance of the following. The function f is called asymptotic to a function g for 
x — +00 if 


lim_ (f(x) — g(e)) =0. 


x—+00 


If (5.13) holds one can then say that f is asymptotic to the line g(x) = mx + q. 
The function f(x) = x? + 4 instead has no line as asymptote for x — +00, but is 
nevertherless asymptotic to the parabola g(x) = 2°. O 


In a similar fashion one defines oblique or horizontal asymptotes for 7 — —oo 
(that is oblique or horizontal left asymptotes). 

If the line y = ma +q is an oblique or horizontal asymptote both for 7 > +00 
and x —> —oo, we shall say that it is a complete oblique or complete horizontal 
asymptote for f. 


Eventually, if at a point zo € R one has lim f(x) = oc, the line x = Zo is 
L—>XO 


called a vertical asymptote for f at x9. The distance between points on the 
graph of f and on a vertical asymptote with the same y-coordinate converges to 
zero for « — xo. If the limit condition holds only for 2 > xj or x > aq we talk 
about a vertical right or left asymptote respectively. 


Examples 5.17 
i) Let f(z) = ——. As 
x 


+1 
lim f(z) =1 and lim. fe) = eo, 
Cie oe, @) g—o—1= 
the function has a horizontal asymptote y = 1 and a vertical asymptote 7 = —1. 


ii) The map f(x) = V1+ <2? satisfies 


+1 


lim f(x) = too, fie 2 ile Wives 


wL—>x0CO w= 00 ax Lr 00 


2,2 
lim (J1+2?-2) = im ee zs =0 


li ——__—— 
at—>+oo z—+>+oo 4/] + 2 +a 
ite? 


in —— 
z>—-wo /ltg2—-¢z 


and 


lim ( 1+22+2) = =0. 


5.4 Further properties of sequences 137 


Therefore f has an oblique asymptote for x + +00 given by y = x, plus another 
one of equation y = —2x for x + —oo. 


iii) Let f(z) = 2+ logz. Since 


Jim (2 + og x) OO, lim (e+ og z) = +00, 
| 
lim a = 1. lim (x + log x — x) = +00, 
x—>+00 Wb x—+00 


the function has a vertical right asymptote x = 0 but no horizontal nor oblique 
asymptotes. 


5.4 Further properties of sequences 


We return to the study of the limit behaviour of sequences begun in Sect. 3.2. 
General theorems concerning functions apply to sequences as well (the latter being 
particular functions defined over the integers, after all). For the sake of complete- 
ness those results will be recalled, and adapted to the case of concern. We shall 
also state and prove other specific properties of sequences. 

We say that a sequence {@,,}n>n, satisfies a given property eventually, if there 
exists an integer NV > no such that the sequence {a,,}n>n satisfies that property. 
This definition allows for a more flexible study of sequences. 


Theorems on sequences 


1. Uniqueness of the limit: the limit of a sequence, when defined, is unique. 
2. Boundedness: a converging sequence is bounded. 


3. Existence of limit for monotone sequences: if an eventually monotone se- 
quence is bounded, then it converges; if not bounded then it diverges (to 
+oo if increasing, to —oo if decreasing). 


. First comparison theorem: let {ay} and {b,} be sequences with finite or 


infinite limits lim a, = @ and lim b, = m. If an < by eventually, then 
t z n—-oo Nn—-oo 
<m. 


. Second comparison theorem (“Squeeze rule”): let {ay}, {b,} and {c,} be 
sequences with lim a, = lim c, = @. If an < bn < cy eventually, then 


noo N—- oo 
lim? 6, = 2. 
Noo 


. Theorem: a sequence {a,,} is infinitesimal, that is lim a, = 0, if and only 
noo 


if the sequence {|a,|} is infinitesimal. 


. Theorem: let {ay} be an infinitesimal sequence and {b,,} a bounded one. 
Then the sequence {a,,b,} is infinitesimal. 


138 5 Local comparison of functions. Numerical sequences and series 


8. Algebra of limits: let {a,,} and {b,,} be such that lim Gn = Cand lim o— 


m (€, m finite or infinite). Then 


Se OF) =f 


each time the right-hand sides are defined according to the Table on p. 96. 
. Substitution theorem: let {a,,} be a sequence with lim a, = @ and suppose 
n— oo 


g is a function defined in a neighbourhood of ¢: 
a) if 2€ R and g is continuous at @, then lim (a, — 40): 


b) if 2 ¢ R and lim g(x) = m exists, then lim g(an) 
wae N00 


Proof. We shall only prove Theorem 2 since the others are derived adapting the 
similar proofs given for functions. 
Let the sequence {G@n}n>n. be given, and suppose it converges to @ € R. 
With ¢ = 1 fixed, there exists an integer n1 > no so that ja, — ¢| < 1 for 
all n > n,. For such n’s then the triangle inequality (1.1) yields 


lan | = |@n —£+ 2| < lan — | + [2] < 14+ [4l. 


By putting M = max{|an,|,..-,|an,|,1 + |é|} one obtains |a,| < M, 
Yn > no. 


Examples 5.18 


i) Consider the sequence a, = q”,where q is a fixed number in R. It goes under 
the name of geometric sequence. We claim that 


does not exist 


If either g = 0 or q = 1, the sequence is constant and thus trivially convergent 
to 0 or 1 respectively. When g = —1 the sequence is indeterminate. 

Let g > 1: the sequence is now strictly increasing and so admits a limit. In order 
to show that the limit is indeed +oo we write q = 1+ r with r > 0 and apply 
the binomial formula (1.13): 


5.4 Further properties of sequences 139 


ma ceyad (fattened (fot 
k=0 k=2 


As all terms in the last summation are positive, we obtain 
(l+r)” >1l+nr, Yn >0, (5.15) 


called Bernoulli inequality!. Therefore g” > 1+ nr; passing to the limit for 
n — co and using the First comparison theorem we can conclude. 


1 
Let us examine the case |q| < 1 with gq 4 0. We just saw that — > 1 implies 


la| 
n 
lim a) = +00. The sequence {|q|"} is thus infinitesimal, and so is {q”}. 
noo qd 
At last, take q < —1. Since 
lim g?* = lim (q7)* = +00, lim g?**! = q lim q?* = —oo, 
k— oo k—- oo k—- oo k—- oo 


the sequence q” is indeterminate. 


ii) Let p be a fixed positive number and consider the sequence »/p. Applying the 
Substitution theorem with g(x) = p” we have 


iii) Consider the sequence %/n; using once again the Substitution theorem to- 
gether with (5.6) c), it follows that 


1 
lim ~/n= lim exp SBT 1g) = 1, 
N— Ooo Noo nr 


There are easy criteria to decide whether a sequence is infinitesimal or infinite. 
Among them, the following is the most widely employed. 


Theorem 5.19 (Ratio test) Let {a,} be a sequence for which a, > 0 
eventually. Suppose the limit 


3 An+1 
lim —2t 


N00 An 


exists, finite or infinite. If q <1 then lim a, = 0; ifq > 1 then lim ay, = 
tae n— Ooo N—- Co 


' By the Principle of Induction, one can prove that (5.15) actually holds for any r > —1; 
see Appendix A.1, p. 427. 


140 5 Local comparison of functions. Numerical sequences and series 


Proof. Suppose a, > 0, Vn > no. Take g < 1 and set ¢ = 1 — gq. By definition of 
limit there exists an integer n- > no such that for all n > nz 


a 
Se @ G@pEeHl, 18.4. Anti = Gn. 
an, 
So the sequence {a,,} is monotone decreasing eventually, and as such it 
admits a finite non-negative limit @. Now if @ were different from zero, the 


fact that 
: Qn+1 Z 
qg= lim —=-=1 
n+00 Ay, L 
would contradict the assumption q < 1. 


If q > 1, it is enough to consider the sequence {1/a,}. 


Nothing can be said if g = 1. 


Remark 5.20 The previous theorem has another proof, which emphasizes the 
speed at which a sequence converges to 0 or +00. Take for example the case q < 1. 
The definition of limit tells that for all r with q < r < 1, if one puts e = r—q 
there is a nz > no such that 


An+1 
an 


<7 that is, O,41 < 7a, 


for each n > n-. Repeating the argument leads to 
Gata < Tin <P Gg < ve ge ea (5.16) 


(a precise proof of which requires the Principle of Induction; see Appendix A.1, 
p. 430). The First comparison test and the limit behaviour of the geometric se- 
quence (Example 5.18 i)) allow to conclude. Formula (5.16) shows that the smaller 
q is, the faster the sequence {a,,} goes to 0. 

Similar considerations hold when q > 1. oO 


At last we consider a few significant sequences converging to +oo. We compare 
their limit behaviour using Definition 5.10. To be precise we examine the sequences 


losnae ag . hen sta Sg =) 


and show that each sequence is infinite of order bigger than the one preceding it. 
Comparing the first two is immediate, for the Substitution theorem and (5.6) c) 
yield logn = o(n®) for n > ov. 

The remaining cases are tackled by applying the Ratio test 5.19 to the quotient of 


‘ n 
two nearby sequences. Precisely, let us set an = —. Then 
q 


— - nN — 00. 
G grtl ne n 


ee 
q q 


Thus lim a, = 0, or n® = o(g”) for n > ov. 
noo 


5.5 Numerical series 141 


nm 


Now take an = —, so 
n! 
n+1 ! 
O62 fe a, 
An (n+1)! q™ (n+1)n! n+1 
and then q” = o(n!) per n > oo. 
Eventually, let a, = an Then 
nr 
Qn41 (n4+1)! nn? _ (n+ 1)n! —_— n 7 
Qn (ntl) nl (nt1)(n+1)" vn! \ntl 
: : > : ell > 
= n = Ja. vm x 2 n OO, 
oy see e 


and so n! = o(n”) for n + oo. To be more precise, one could actually prove the 
so-called Stirling formula, 


n n 
nin ann (~) ; n> oO, 
e 


a helpful approximation of the factorial of large natural numbers. 


5.5 Numerical series 


Consider a segment of length @ = 2 (Fig.5.2). The middle point splits it into 
two parts of length ag = €/2 = 1. While keeping the left half fixed, we further 
subdivide the right one in two parts of length a; = ¢/4 = 1/2. Iterating the 
process indefinitely one can think of the initial segment as the union of infinitely 
many ‘left’ segments of lengths 1, 5, <, t, a ... Correspondingly, the total length 
of the starting segment can be thought of as sum of the lengths of all sub-segments, 


in other words 
aie ee ae (5.17) 
i 2 4 8 16 °° , 


On the right we have a sum of infinitely many terms. The notion of infinite sum 
can be defined properly using sequences, and leads to numerical series. 


1 1 i 2 
2 4 8 16 
0 3 ae 

1 2 4 8 2 


Figure 5.2. Successive splittings of the interval [0, 2]. The coordinates of the subdivision 
points are indicated below the blue line, while the lengths of subintervals lie above it 


142 5 Local comparison of functions. Numerical sequences and series 


Given the sequence {ax }k>0; one constructs the so-called sequence of partial 
sums {s,,}n>0 in the following manner: 


S59 = 40, 8; =agt+ a1, S82 =ag+a,+ aa, 


and in general 


n 
89 = @@ ar Gi Poco a Oa = ; Qk - 
k—0 


Note that s, = 5,-1; +@,. Then it is only natural to study the limit behaviour of 
such a sequence. Let us (formally) define 


The symbol So ak is called (numerical) series, and a, is the general term of 
k=0 
the series. 


nmr 
Definition 5.21 Given the sequence {ax}%>0 and sp = was consider the 
k=0 
limit lim spy. 
noo 
Co 
i) If the limit exists and is finite, we say that the series S dp converges. 


k=0 
The value s of the limit is called sum of the series and one writes 


[o-2) 
§ = ) Qk: 
k=0 


ii) If the limit exists and is infinite, we say that the series se a, diverges. 
k=0 


CO 


[o2) 
iit) If the limit does not exist, we say that the series Ss ax is indetermin- 


k=0 
ate. 


Examples 5.22 


i) Let us go back to the interval split infinitely many times. The length of the 


shortest segment obtained after k + 1 subdivisions is a, = o- k > 0. Thus, we 
[o.@) 


1 
consider the series S- 5K Its partial sums read 
k=0 


5.5 Numerical series 143 


= eo SiS 
59 = ’ io 9° 9? $2 = ) 4 A* 
=e tad 
Sn = 5) ees Qn * 


Using the fact that at! — b”*! = (a — b)(a" +a" 1b +... + ab"! +6"), and 
choosing a = 1 and b = x arbitrary but different from one, we obtain the identity 


a a alae (5.18) 
—2£ 
Therefore 
1 l-sdn 1 1 
n=l = ano 2 = iL =2-—, 
S tat 1 on 1 ( sat) an 
and so 
1 
lim s, = lim |(2-— —]=2. 
n—0o n—- oo Qn 


The series converges and its sum is 2. This provides solid ground for having 
written (5.17) earlier. 


ii) Consider the series Ss" k. Recalling (3.2), we have 


k=0 
“n(n +1) 
sy, = Ss" k= 5 
k=0 
Then 
1 
lim s, = lim need) = +00, 
n—-> Ooo N— Oo 


and the series diverges (to +00). 


iii) The partial sums of the series S°(-1)* satisfy 


k=0 
so=1, 8s; =1-1=0 
Sso9=s,t+1l= 83 = 89 -1=0 
San = 1 Sont1 = 0. 


The terms with even index are all equal to 1 while the odd ones are 0. Therefore 
lim s, cannot exist and the series is indeterminate. Oo 
noo 


Sometimes the sequence {a;} is only defined for k > ko with kp > 0; Defini- 
tion 5.21 then modifies in the obvious way. The following fact holds, whose rather 
immediate proof is left to the reader. 


144 5 Local comparison of functions. Numerical sequences and series 


Property 5.23 The behaviour of a series does not change by adding, chan- 


ging or removing a finite number of terms. 


This property does not tell anything about the sum of a converging series, which 
in general changes by manipulating the terms. For instance 


Examples 5.24 


ee 
i) The series Ss" (k—1k is called series of Mengoli. As 
k=2 


oe i 
Oe (k—-Dk k-1 
it follows that 
—— 
See ne ie 9 


and in general 


ee or eee ee re Gene Pe Gee ee 
Sn = a2 ag Fae An = 5) 5) 3 eae | rs = A 


Thus 
. . 1 
lim s, = lm (1-+) =1 
noo n—-0o n 


and the series converges to 1. 


~ 1 
ii) For the series S- log (1 + :) one has 


k=1 
1 k+1 
ar = log (1 + *) = log —— = log(k +1) —logk 

so 

8 = log2 

so = log2 + (log3 — log 2) = log3 

Sn = log2 + (log3 — log 2) +...+(log(n + 1) — logn) = log(n +1). 
Then 


lim s, = lim log(n +1) = +00 


and the series diverges (to +00). 


5.5 Numerical series 145 


The two instances just considered belong to the larger class of telescopic 
series. These are defined by ax, = by41 — bx for a suitable sequence {bg }x%>x,- 
Since 8, = bn41 — by,, the behaviour of a telescopic series is the same as that of 
the sequence {bx}. 

We shall now present a simple yet useful necessary condition for a numerical 
series to converge. 


[o-2) 
Property 5.25 Let Sak be a converging series. Then 
k=0 


linn a Oe 
k—- oo 


Proof. Let s= lim s,,. Since az = s,% — 8,_1, then 
noo 


lim.¢. = (Sp — Sk-1) = 8 —s=0, 


lim 
k—-oo k-o00 


ie., {ay} is infinitesimal. 


Observe that condition (5.19) is not sufficient to guarantee that the series 


converge. The general term of a series may tend to 0 without the series having to 
[o.) 


1 
converge. For example we saw that the series S- log (1 + :) diverges, but at the 
k=1 


i 
same time jim log (1 + :) = 0 (Example 5.24 ii)). 
— co 


If a series converges to s, the quantity 


[o-e) 
fh =S8—8, 5 y Qk - 


k=n+1 


is called nth remainder. 


CO 
Property 5.26 Take a converging series SS az. Then the remainder satisfies 
k=0 


lim r, =0. 
n—-oo 


Proof. Indeed, 


146 5 Local comparison of functions. Numerical sequences and series 


Example 5.27 


Consider the geometric series 


[o.) 
ae 


k=0 


where gq is a fixed number in R. 
If g=1 then s, = agp +ai,+...t¢a, =14+1+4+...+1=n+4+1 and im Sn = +00, 


whence the series diverges to +00. 
If g # 1 instead, (5.18) implies 


i=g"* 
n=lt+qt@t...+q =—— 
lq 
Example 5.18 gives 
1 
— if |ql <1, 
ya gnt l-—gq 

ee? gas Eg ee if q> 1, 


does not exist ifq<-—-1l. 
In conclusion, 


converges to 7 


diverges to +00 


is indeterminate 


[o.2) 
That said, it is not always possible to predict the behaviour of a series ys Ak 
k=0 
using merely the definition. It may well happen that the sequence of partial sums 
cannot be computed explicitly, so it becomes important to have other ways to es- 
tablish whether the series converges or not. Only in case of convergence, it could be 
necessary to determine the actual sum. This may require using more sophisticated 
techniques, which go beyond the scopes of this text. 


5.5.1 Positive-term series 


We deal with series Ss" a, for which az, > 0 for any k € N. The following result 


k=0 
holds. 


(oe) 


Proposition 5.28 A series De Gi with positive terms either converges or 


k=0 
diverges to +c. 


5.5 Numerical series 147 
Proof. The sequence s, is monotonically increasing since 
Sax = Sn + ty. = Sas Vn >0. 


It is then sufficient to use Theorem 3.9 to conclude that lim s,, exists, 
noo 


and is either finite or +oo. 


We list now a few tools for studying the convergence of positive-term series. 


CO CO 

Theorem 5.29 (Comparison test) Let Sak and Su: be positive-term 
k=0 k=0 

series such that 0 < az < by, for any k > 0. 


[e-@) [e2) 
i) If the series So be converges, then also the series So ak converges and 


k=0 k=0 
Co Co 
k=0 k=0 
CO 


CO 
My ly So ax diverges, then S- bk diverges as well. 
k=0 k=0 


[o<) (oe) 
Proof. i) Denote by {s,} and {t,} the sequences of partial sums of .¥ Qk, >. by 


k=0 k=0 
respectively. Since az < by for all k, 
SS tes Yn >0. 
[ee 
By assumption, the series So be converges, so lim t, = t € R. Propos- 
k=0 noo 
ition 5.28 implies the limit lim s, = s exists, finite or infinite. By the 
noo 
First comparison theorem (Theorem 4, p. 137) we have 
s= lim s, < limt, =teER. 
noo nm—-co 
[o.@) 
Therefore s € R, and the series > az converges. Furthermore s < t. 
k=0 
[oe (oe) 
ii) If the series :, b, converged, part i) of this proof would force >, ak 
k=0 k=0 


to converge, too. 


148 5 Local comparison of functions. Numerical sequences and series 


Examples 5.30 


(oe) 


1 
i) Consider > om Since 
k=1 
1 1 
—= < ——_ Vk > 2. 
k? ~ (k—1)k = 
and the series of Mengoli S- converges (Example 5.24 i)), we conclude 


— (k—1)k 
that our series converges aad its sum is smaller or equal than 2. One could prove 
that the precise value of the sum is 77/6. 

1 
ii) The series » E is known as harmonic series. In Chap. 6 (Exercise 12) we 


k=1 
shall prove the inequality log(1+ x) < a, for all x > —1, whereby 


log (1+ 7) <c, Vk > 1. 


= 1 
Since the series S— log (1 + =) diverges (Example 5.24 ii)), then also the har- 
k=1 


monic series must diverge. 


Here is a useful criterion that generalizes the Comparison test. 


CO (oe) 
Theorem 5.31 (Asymptotic comparison test) Let Sa: and Se: be 
k=0 k=0 


positive-term series and suppose the sequences {ax}x>o0 and {by}k>0 have the 


same order of magnitude for k + oo. Then the series have the same behaviour. 


Proof. Having the same order of magnitude for k + oo is equivalent to 
lim *§ =0eR\ {0}. 
k—+00 bp 


7 b 

Therefore the sequences ie} and ia} are both convergent, 
bk k>0 ak ) K>0 

hence both bounded (Theorem 2, p. 137). So, there must exist constants 


My,, Mz > 0 such that 


for any k > 0, i.e., 


5.5 Numerical series 
|ax| < M;|b;| and |bx.| < Mp|ax|. 


Now it suffices to use Theorem 5.29 to finish the proof. 


Examples 5.32 


_ ~~. k+38 1 
i) Consider » ak = Ss" ae and let 6; = i Then 
k=0 k=0 
li Qk = 1 
ae br 7 


and the given series behaves as the harmonic series, hence diverges. 


Re oR? 


149 


= ee 1 1 
ii) Take the series S- ak = Ss" sin Rp As sin — ~ -—> for k > oo, the series has 
k=1 k=1 


CO 
the same behaviour of y oe so it converges. 
k=1 


Eventually, here are two more results — of algebraic flavour and often easy to 
employ — which provide sufficient conditions for a series to converge or diverge. 


Theorem 5.33 (Ratio test) Let So ak have a, > 0, Vk > 0. Assume the 


k=0 
limit 


: QAk+1 
fie = 
k-+oco Qk 


exists, finite or infinite. If <1 the series converges; if £ > 1 it diverges. 


Proof. First take @ finite. By definition of limit we know that for any ¢ > 0, there 


is an integer k, > 0 such that for all k > kz one has 


ak ; ak 
tet 4] ce i.e., (26. 2is 4, 
ak Qk 
Assume £ < 1. Choose ¢ = — and set gq = He sO 
Qk+1 
0< — <f+e=q, a 
Ak 


Repeating the argument we obtain 
2 k—ke 
Akt1 < ap <Q ag-1<..- <q Ake +1 


hence 


150 5 Local comparison of functions. Numerical sequences and series 


is oe igh, Wk > ke. 
qk 
The claim follows by Theorem 5.29 and from the fact that the geometric 
series, with gq < 1, converges (Example 5.27). 
Now consider > 1. Choose e = ¢— 1, and notice 


Qk+1 
l=l-e< Vk > ke. 
Qk 
Thus a441 > @, >... > A%,.41 > 0, so the necessary condition for conver- 
gence fails, for tim oy = 1), 


Eventually, if ee = = we put A = 1 in the condition of limit, and there 
exists kg > 0 with ax, > 1, for any k > ka. Once again the necessary 
condition to have convergence does not hold. 


oe) 
Theorem 5.34 (Root test) Given a series 3 ap with non-negative terms, 


k=0 
SUPPOSE 


lim ap = 
k- oo 


exists, finite or infinite. If <1 the series converges, if £ > 1 it diverges. 


Proof. Since this proof is essentially identical to the previous one, we leave it to 
the reader. 


Examples 5.35 


(oe) 


k 1 
i) For 3h we have az = ae and ax41 = — therefore 
k=0 
Ak+1 : Lk + 1 1 
l = | —~—— = -<]l 
ner Qk ioe 3.0é«+&k 3 - 


The given series converges by the Ratio test 5.33. 
[e.) 
il 
ii) The series S- Tk has 
k=1 
: 1 
lim */a, = jim —-=0<1. 
k-oo k300 k 
The Root test 5.34 ensures that the series converges. 


We remark that the Ratio and Root tests do not allow to conclude anything 


if 2 = 1. For example, > ; diverges and > =) converges, yet they both satisfy 
Theorems 5.33 and 5.34 with @= 1. 


5.5 Numerical series 151 
5.5.2 Alternating series 


These are series of the form 


S-(-1)*b. = with =e > 0, VE DO. 


k=0 


For them the following result due to Leibniz holds. 


Theorem 5.36 (Leibniz’s alternating series test) An alternating series 
Co 


S > (-1)* be converges if the following conditions hold 
k=0 


2) lim. be = 0 ; 
k—o0o 


it) the sequence {by }%>0 decreases monotonically . 


Denoting by s its sum, for alln > 0 


lrn| =a Is me Fall Ss Opel and S2n+1 SSS yp - 


Proof. As {b,}x>0 is a decreasing sequence, one has 
S2n = 82n-2 — bon—1 + ban = San—2 — (ben-1 — ban) < S2n—2 


and 
S2n4+1 = §2n-1 ae ban = ban+1 = S2n-1- 


Thus the subsequence of partial sums made by the terms with even index 
decreases, whereas the subsequence of terms with odd index increases. For 
any n > 0, moreover, 


$2n = 8an—1 + bon 2 Siy=1 2 +54. 81 


and 
S2n+1 = $2n — bon+1 at S2n ws te = S0Q- 


Thus {s2n}n>o is bounded from below and {s2n+1}n>0 from above. By 
Theorem 3.9 both sequences converge, so let us put 


litt, #95 = inf 6, = 5" and lim Son4-4 = SUP S4q4-1 = Sx. 
n—-0o n>0 ee SO n>0 


However, the two limits coincide, since 


s —s, = lim (son - Son+1) =. lit ba,24 = 0; 
n—-oo >So 


152 5 Local comparison of functions. Numerical sequences and series 


[o.e) 
we conclude that the series S>(-1)*bx has sum s = s* = s,. In addition, 
k=0 
Santi <8 < S2n, Vn = 0, 


in other words the sequence {s52n}n>0 approximates s from above, while 
{Son+1}n>0 approximates s from below. 
For any n > 0 we have 


0 8= 0,44 © Sone = Sint S Opa 


and 
0 = 52n — § < 597 — Sond = Denis 


ee fe] = = Bgl Se lads 


Example 5.37 
: _— oe 
Consider the alternating harmonic series So(-1) Ee Given that 
k=1 


lim bh = lim — =0 
k- 00 k>00 k 


1 
and the sequence iz} is strictly monotone decreasing, the series converges. 
k>1 


In order to study series with arbitrary signs it is useful to introduce the notion 
of absolute convergence. 


[e-2) 
Definition 5.38 The series S- ax converges absolutely if the positive- 


k=0 
foe) 


term series y |ax| converges. 
k=0 


Example 5.39 


CO CO 
1 1 
The series y Ce converges absolutely because ; Bp converges. 


The next fact ensures that absolute convergence implies convergence. 


5.5 Numerical series 153 


[e-e) 
Theorem 5.40 (Absolute convergence test) /f So ak converges abso- 
k=0 


lutely then it also converges and 


Proof. Let us introduce the sequences 


4 ‘s if az, > 0 
~ LO ta <0 


_ 0 iag 2 0 
and a, = 
—a, ita, <0. 
Notice oe a, = 0 for any k > 0, and 
a, =a, —a; , |a,| = az +a, . 


Since 0 < af,az < |a,|, for ao k= 0, ie Comparison test (The- 


orem 5.29) says that the series a and oo converge. Observing 
k=0 k=0 


co CO Co (oe) 

> => +__-\= + - 
— (af —a,) = oat — Sag, 

k=0 k=0 k=0 k=0 


[e.@) [oe [o.@) 
for any n > 0, we deduce that also the series Sak = eo — +6; 
k=0 k=0 


that 


converges. 
Finally, passing to the limit n — oo the relation 


yields the desired inequality. 


Remark 5.41 There are series that converge, but not absolutely. The alternating 

[o-2) 

iL 
k 


harmonic series y (—1)”— is one such example, for it has a finite sum, but does not 


k=1 
oe) 


converge absolutely, since the harmonic series y i diverges. In such a situation 
k=1 


one speaks about conditional convergence. 


The previous criterion allows to study alternating series by their absolute con- 
vergence. As the series of absolute values has positive terms, the criteria seen in 
Sect. 5.5.1 apply. 


154 5 Local comparison of functions. Numerical sequences and series 


5.6 Exercises 


1. Compare the infinitesimals: 


ee = -1, ( (/e—-1)? forz>1 


mor ; a3? for x 4 +00 
£ 


2. Compare the infinite ae 


ja) | a, Vax Qn ii il oe when x — +00 
y, 


b) aaa tloga, 273", 37logx when x + +00 
log x 


Verify that f(x) = /x+3— V3 and g(x) = /x +5 — V5 are infinitesimals 
of the same order for x + 0 and determine ¢ € R such that f(x) ~ g(x) for 
i aoe a8 OB 


4. Verify that f(x) = Wx — 2x2 +1 and g(x) = 2x +1 are infinite of the same 
order for x > —oo and determine ¢ € R with f(x) ~ g(x) when x — —ov. 


5. Determine the order and the principal part with respect to y(x) = 1, for 
x — +00, of the infinitesimal functions: 


(a) |e) = b) f(e) =f 5-1 


Sy) nt) =a (VET) [a)] F(a) = tog (9+ sin =) 21083 


6. Determine the order and the principal part with respect to y(x) = 2, for 
x — +00, of the infinite functions: 


2 4 = 1 
la) | f@)=2-Ver +e ) (0) = SS SS 


7. Find the order and the principal part with respect to p(x) = x, for x > 0, of 
the infinitesimal functions: 


la) | F@) (V1 + 3a — 1) sin 22? b) f(a) = Wcosx—1 


1+ 323 


2 
e) f(x) =logcosx f(z) =e? =e tt 


5.6 Exercises 155 


8. Find the order and the principal part with respect to p(x) = x—4%o, for x —> 20, 


of the infinitesimals: 


[2) | F@) = logx —log3, ri =o 
le) | f(@) =e" -e, to= 1 
le) | f(@) =1+ cosa, Lo= 7 


9. Compute the limits: 
aa 1 Vv 1 + 3a? 1 
1m — ey | 
z—0 x2 cos © 


lian log(3 — Vz + 1) 


xL>3 3-2 


b) f(x) = Vz- v2, Lo = 2 
d) f(#)=sing, Cj = 7 


f) f(x) =sin(mcosz), Lo=7 


WE V3? 


eas p= 2 


 _gVF2 _ ev3 
a Gone 


10. Determine domain and asymptotes of the following functions: 


(a) | Fw) = 


(| fa) aes 
7 22 +3 


e) f(x) = (1+ ~) 


11. Study the behaviour of the sequences: 


3 —4n 
ee 


a. ne+2 
[D] a, = aa 


12. Compute the following limits: 


| 

3 
Q 
fo) 
nA 


i n?+1 
a im —— 
noo 27 4 57 


b) f(x) =2+2arctanz 


d) f(z) =ael/lr'-" 


'f) | £@) = log(z + e”) 


nee 
Bb) a= (= 1)” 
) oy = (IP 
! 
d) Soe 
nl 


156 5 Local comparison of functions. Numerical sequences and series 


cos n 


c) lim 
nc Nn 


lim %/3n3 +2 


n> co 


/ i 
8) | lim (; a) 
n—- oo n 


d) lim (1+ (-1)") 


noo 


!_ yy! 
lm (n+ 3)!—n! 


noo n?(n+1)! 


13. Study the convergence of the following positive-term series: 


3 
a) De 2k +1 
k=0 


EP 
a k arcsin a 


14. Study the convergence of the following alternating series: 


(oe) 


15. Study the convergence of: 


De (1 cos 7) 
0 Lal) 


k=1 


sj oy (ae € + 1) 


16. Verify that the following series converge and determine their sum: 


~ (2k + 1)(2k +3) 


5.6 Exercises 157 


5.6.1 Solutions 


1. Comparing infinitesimals: 


a) Since 


UN ai ae 
ma ele @—h@etiy =Wetip 


we have, for x > 1, 


v-1so({/E-a), (fe —1)? =o(4-1). 


Thus we can order the three infinitesimals by increasing order from left to 
right: 
1 


(/—--1, 2-1, (/z-1)°. 
x 


The same result can be attained observing that for x > 1 


= -1= =(9=1)" 


aie oh 


so (\/#—1)? ~ F(a 1)’. 


b) Putting in increasing order we have: 


and 


mg ae se a ae 
a 


2. Comparison of infinite maps: 
a) As 
4 4 1/3 


wo : w 


lim ———— = lim —————= lm ——— 
w—++oo Wall —Qy2 — a>+o0 gll/3 7/1 — 2-9 w&>+00 W1 — Qr-9 


it follows Val! — 2a? = o(ax*) for 2 + +00, so Ya" — 22? is infinite of smaller 


order than x*. 


= +00, 


4 


It is immediate to see that ——_—— = o(a*). Moreover 
log(1 + a) 


158 5 Local comparison of functions. Numerical sequences and series 


Vail — 2x2 log(1 + 2) i log(1+2)V1— 22-9 
SS SS Sl Se 


ous x xt—+00 1/3 
log(1 
= tim MECH) _ 9, 
&%—-++00 x 
4 
that is, V1! — 2272 =o __“___\ Therefore the order increases from left 
log(1 + a) 
to right in 
Vall — 2x? eee x 
’  log(1+ 2)’ 


2 


b) Following the increasing order we have x log z, —, 37 logx, x73". 
og x 


3. Since 


eS 
20 J/e+5—V5 270 (2 +5—5)(/e +34 V3) 
Vet5+Vv5_ [5 


= im —_—_—_—————_ = 
r0 Vr¢at+v3 V3 


we conclude that f(x) ~ VS a(@) as x > 0. 


4. The result is f(x) ~ 4 9(x) for 2 4 —oo. 


5. Order of infinitesimal and principal part: 


a) We have 


g49° 2% = 
= lim Ss = lim 22°-?. 
x—+00 x x—+00 


x—+00 Lyx? t—+oo 


D2 5 
fl2) jp a2 
x 


This limit is finite and equals 2 if a = 2. Therefore the order of f(x) is 2 and 
its principal part p(x) = 3. 
Alternatively one could remark that for x — +00, */% = o(x”), so 2x? + {/r ~ 
227 and then f(x) ~ 2x" =e, 

b) This is an infinitesimal of first order with principal part p(x) = —2. 


2x 
c) Note first of all that 


2 —_ 2 
lim (V2?-1-2) = lim eee a, 


x—+00 


hence the function f(x) is infinitesimal for 7 + +o. In addition 


sin (Vx? — 1-2) . siny 
in ~ ————- = lim _ 


1. 
xwr—-+00 V/72—-1—-2 yO y 


5.6 Exercises 159 


Then 
sin (Vx? —l—2z 
lim x sin ( 1-2) = lim 2% (Vx? =1-2) nee o) 
t—-+o0o t—+00 72 —1—-2 
= lim 2% (Ve? —1 —x) : 
t—+00 


One can otherwise observe that sin g(x) ~ g(x) for x > Xo if the function g(a) 
is infinitesimal for x > xo. For > +00 then, 


sin (Va? — 1-2) ~Va2-1l-2@ 
and Proposition 5.5 yields directly 


lim x%sin (V2? — 1-2) = lim x ( m—1-2). 


t—-+o0o L—>-F OO 
Computing the right-hand-side limit yields 
re 
sa Rlhe oe case tp alte 
pee ee . 7 eee e?—-1+2 2—¥+00 ges 1 +1 2 
if aw = 1. Therefore the order is 1 and the principal part reads p(#) = 5. 
d) Consider 


2 1 2 
log (94+sin = — 2log3 = log9 (1 + —sin = — log 9 
at 9 x 


1, 2 
=log{1+-=sin—]. 
9 e 
2 2 


For x — +00 we have }sin2 ~ 2 (see the previous exercise) and log(1+y) ~ y 
for y > 0. So 


1 2 
jin, ORRAS He, at dee & tee Se 
ae ee 6g ee es 


if a = 1. Thus the order of f is 1 and its principal part p(a) = =. 


6. Order of infinite and principal part: 
a) A computation shows 
2-a _ =| 


= lim —~——W———_ =- lim ¢£ 
t—>+too go t—+00 re t—+00 


when a = 2. Then f has order 2 and principal part p(x) = —2?. 
b) The order of f is 1 and the principal part is p(a) = 2z. 


160 5 Local comparison of functions. Numerical sequences and series 


7. Order of infinitesimal and principal part: 


a) First, V1 + 32 —1~ 32 for x > 0, in fact 


. v1l+3c-1 . 2 1432-1 
hin, —— = Ln 2 
x20 zr 20 3 x(/1 + 32+ 1) 


2 
lim ———_- = 1 
a0 afl oa 1 
But sin 2x? ~ 2x? for x > 0, so 
3 2 ; 3 
Fla) ~ 5u- 2a", i.e., f(z) ~ 32°, «0. 


Therefore the order of f is 3 and the principal part is p(x) = 32°. 


b) The order of f is 2 and the principal part is p(w) = — a. 


c) The function f has order 3 and principal part p(x) = —$2°. 


d) Using the relation e” = 1+ 2+ o(x) for x > 0 we have 


for a = 1. The order of f is 1 and the principal part is p(x) = . 


e) The function f has order 2 with principal part p(x) = —$2. 


f) Recalling that 
cosa = 1 — 52? + o(2”) x0, 
V1 = (+09)? = 14 50 + o(0°) x0, 
e=1+t+o(t) +t-0, 
we have 


f(a) = el 32" +0(2”) _ eitge?+o(2*) =" (e-bet tote") _ ete! tale") 


8. Order of infinitesimal and principal part: 


a) Set t= 2 —3 so that t — 0 when x > 3. Then 


5.6 Exercises 161 


t t 
log x — log 3 = log(3 + t) — log3 = log3 (1+ 5) — log 3 = log (1+ 5) : 
Since log (1 + t) ~ 5 for t — 0, it follows 
il 
f(x) = logx — log3 ~ 3(x — 3), x 3, 


hence f is infinitesimal of order 1 and has principal part p(x) = 4(# — 3). 


b) The order of f is 1 and the principal part is p(a) = V2 (x — 2). 


1 
3 


c) Remembering that e’ -1~tast— 0, 


f(a) =e(e? ~* — 1) ~ e(a* — 1) 
=e(e+1)(e¢-—1)~2e@-1) for z—->1. 


Thus f is infinitesimal of order 1 and has principal part p(x) = 2e(x — 1). 
d) The order of f is 1 and the principal part p(x) = —(x — 7). 
e) By setting t = x — 7 it follows that 


1+cosx =1+cos(t+7)=1- cost. 
But t > 0 for z > 7, so 1—cost ~ $t? and 


1 
f(a) =1+ cose ~ 5(x—m)”, LOT. 


9. Limits: 


a) We remind that when x — 0, 


3 1 
V1+322 =1+ xu + o(a?) and cosx = 1— st + o(x”), 


so we have 
i V1+ 322 —cosx 1+ 3x? —1+4 $2? + o(2?) 
240 x? COS x 2-0 2 
9) 2 2 
=f 
xz—0 xr? 


162 5 Local comparison of functions. Numerical sequences and series 


c) Let y =3-—2, so that 


f= Tin log(3 — Vz + 1) 2a Abs log(3 — ./4—y) 
r—+37 3-2 yor Y 
_  log(3 — 2/1 — y/4) 
= lim 
yor 7] 


But since ,/1 — y/4=1— sy +o(y), y > 0, we have 


a log(3 — 2+ 4+ o(y)) log(1 + 4+ o(y)) 


yor Yy yor y 
e+o 1 
— im 2 to) _ 1 
yor Yy 4 


d) Albeit the limit does not exist, the right limit is +oo and the left one —oo. 


10. Domain and asymptotes: 
a) The function is defined for x? — 1 > 0, that is to say 2 < —1 and x > 1; thus 


dom f = (—oo, —1) U (1, +00). It is even, so its behaviour on x < 0 can be 
deduced from x > 0. We have 


2 1 2 
“(1+ 
lim j(2)= 1 (1 + 32) = lim — =+0 
L—>=0O woo || 1 = a Loo || 
i oe lim, f(a) = = + 
Ge et gee gp 
The line x = —1 is a vertical left asymptote and x = 1 is a vertical right 


asymptote; there are no horizontal asymptotes. Let us search for an oblique 
asymptote for x — +oo: 


(x? +1)? — 2442? 
t++oo Vx? — 1(42+1+aVx? — 1) 
3a? +1 _ 8a? 
= lim 


TT TT _4S$—_0_0q>0>——qw = lim 5,3 
TI+00 3 -(1++ — +) t—>+oo 27 


showing that the line y = x is a oblique right asymptote. 
For « — —oo we proceed in a similar way to obtain that the line y = —z is an 
oblique left asymptote. 


5.6 Exercises 163 


b) dom f = R; y = «+7 is an oblique right asymptote, y = x — 7 an oblique left 
asymptote. 


c) The function is defined for z 4 —3, hence dom f = R \ {—3}. Moreover 


a? —(#+1)(2—-2) 207-7 —-2 


2 _ 1)(x—2 > . ofl 
lim f(z)= lim BNE. hee, ee tl 
~—-+00 @—++00 274 +3 t—++oo 27 +3 2 
Bie 1)(2- 4 
2+—3* 2>—3* 22+ 3 = 
making the line y = 4 a horizontal right asymptote and 7 = —3 a vertical 
asymptote. Computing 
2p 
ai F(z) _ Lge eg 
@-0co 6" x2 —0o x(2x + 3) 
—44—-—2 _ 


li i = = 


x 


tells that y = x — 2 is an oblique left asymptote. 


d) dom f = R\ {+1}; « = 41 are vertical asymptotes; the line y = x is a complete 
oblique asymptote. 


e) dom f = (—oo, —1)U(0, +00); horizontal asymptote y = e, vertical left asymp- 
tote « = —1. 

f) the function f is defined for x + e” > 0. To solve this inequality, note that 
g(x) = x +e” is a strictly increasing function on R (as sum of two such maps) 
satisfying g(—1) = —1+ 4 <0 and g(0) = 1> 0. The Theorem of existence of 
zeroes 4.23 implies that there is a unique point x € (—1,0) with g(x0) = 0. 
Thus g(x) > 0 for > x and dom f = (a9, +00). Moreover 


lim f(x) =log lim (& +e”) = —oo and lim f(z)=+0, 


coat coat L100 


sO © = 2o is a vertical right asymptote and there are no horizontal ones for 
x — +00. For oblique asymptotes consider 


o Je) log e”(1 + re~*) _ «£+log(1+aze-*) 
lim ——== lim —— ~~ = lim ——=+— 
Z—>+0c0o 2 t—-+oo x t—-++ oo x 
log(1 ae 
as Tie log(l + ze™*) _ ie 
xL—>+00 xv 


lim (f(z) —2) = im log(1 + ze~*) = 0 


x—-+00 


because lim xe” = 0 (recalling (5.6) a)). Thus the line y = z is an oblique 
L—-+0o 


right asymptote. 


164 5 Local comparison of functions. Numerical sequences and series 


11. Sequences behaviour: 
a) Diverges to +00; b) indeterminate. 


c) The geometric sequence (Example 5.18 i)) suggests to consider 


4” ((3)" -1 
lim a, = lim eG) S4) =—-l 
n—-0o n—0o Ania” + 1) 


9 
hence the given sequence converges to —1. 


d) Diverges to +00. 


e) Let us write 


2n(2n—1)---(n+2)(n+1)_ 2% 2-1 n+2 nt+1 
ty = — = ee 
n(n—1)---2-1 n n-1 2 


n+l. 


As lim (n+ 1) = +00, the Second comparison theorem (with infinite limits) 
fetiees (he sequence to diverge to +00. 

f) Converges to 1. 

g) Since 


il 
= exp (vrs ie amills Ss). 


n2+n+2 


we consider the sequence 


Note that 
implies 


Then 
Vn? + 2(2n +1) 5... Bae 
m =— lim 


noo n—0o n2+tn+2 n>co nN 
and the sequence {a,,} converges to e~? 


h) Call 2 = 27"n, so that x + OT for n + oo. Then 


sin x 


lim a, = lim 7 
noo x—0t L 


= T 


and {a,,} converges to 7. 


i) 


5.6 Exercises 


Because 


putting z = = has the effect that 


T sin x 


: : . TT T 
lim a, =— lm nsin— =— lim = — 
n—00 n—00 2n z30t 2 £ 2 

Tv 
thus {a,} converges to —$. 
i 


Converges to —5. 


. Limits: 


0. 


Since 4 — 0 for n > oo, 


1 1 1 2 1 1 
Le SSL a eS and LS op 
n 2n n n n n 


so 
lim n eee ee = lm n a id Z ss 
0; d) does not exist. 


Let us write 
1 
V/3n3 + 2 = exp ( log(3n? + 2) 
n 


and observe 


1 
— log(3n® + 2) = 
n n n 


In addition ; ; 


Thus 1 
lim —log(3n? + 2) =0 


noo 1 
and the required limit is e? = 1. 


From 


165 


log (3n? (1+ 345)) _ log3 r 3logn : log (1+ 325) 


(n+3)!—n!l — ni((n+3)(n4+2)(n+1)—-1) | (n+3)(n+2)(n+1)-1 


n(n+1)! n?(n+1)n! n?(n + 1) 
it follows that 


jeg Os ge, a 


Is 
noo n?(n+1)! n—-+00 n?(n + 1) 


166 5 Local comparison of functions. Numerical sequences and series 


g) As 
3 1 1 
i alee Se verano Genel I noo, 
3n n 
we have 
1 1 1 1 
lim n 1+-—-1 — limn —-+o|{—- on 
n—0o n n—0o 3n n 3 
h) 1 


13. Convergence of positive-term series: 


a) Converges. 


b) The general term a, tends to +oo for k — oo. By Property 5.25 the series 
diverges. Alternatively, the Root test 5.34 can be used. 


c) By the Ratio test 5.33 one obtains 


k>00 GE 7 rane (k+1)!3"’ 


Oe ahr) Bl 
lim 


writing (k + 1)! = (k +1)k! and simplifying we see that 


, Ak+1 
lim = tin —— = 
k+00 Gk kao k+1 


and the series converges. 
d) Again with the help of the Ratio test 5.33, we have 


k 

_ Oki1  ,, (kK+1)! KF. k 1 
= lin ee ey ees | 
a eee ee * 


and the series converges. 


e) Notice 


7 
Gn ~ hoe = 5 for k> ow. 


By the Asymptotic behaviour test 5.31, and remembering that the harmonic 
series does not converge, we conclude that the given series must diverge too. 


f) Converges. 


14. Convergence of alternating series: 


a) Converges; b) does not converge. 


5.6 Exercises 167 


c) Since 


1 1 1 
sin (i + *) = cos(k7) sin <= (—1)* sin ER? 


the given series has alternating sign, with b; = sin i. As 


lim 6b; = 0 and bea < be, 
k-o0o 


Leibniz’s test 5.36 guarantees convergence. The series does not converge abso- 
lutely since sin ¢ ~ + for k — oo, so the series of absolute values behaves as 


the (diverging) harmonic series. 


d) By using one of the equivalences of p. 127 one sees that 


[-» (1+) -1) V2 


~N Fe 5 

Example 5.30 i) suggests to apply the Asymptotic comparison test 5.31 to the 
series of absolute values. We conclude that the given series converges abso- 
lutely. 


k> ow. 


15. Study of convergence: 


a) Converges. 
b) Observe first that 


for allk > 0; 


[o.@) 
the series > a converges and the Comparison test 5.29 tells that the series 
of spe wales converges. Thus the given series converges absolutely. 

c) Diverges. 

d) This is an alternating series where b, = 4/2 — 1. The first term bo = O apart, 
the sequence {bi }x>1 decreases because V2 > *W/2 for all k > 1. Thus Leib- 
niz’s test 5.36 allows us to conclude that the series converges. Notice that 
convergence is not absolute, as 


og ] 2 
V2—-1=eF -1~ =, k> oO, 


confirming that the series of absolute values is like the harmonic series, which 
diverges. 


16. Computing the sum of a converging series: 


a) —2. 


168 5 Local comparison of functions. Numerical sequences and series 


b) Apart from a constant, this is essentially a geometric series; by Example 5.27 
then, 


Se. ry ee 3 
sa -5> (a) =3(--1)-% 
k=1 k=1 16 

(note that the first index in the sum is 1). 
c) The series is telescopic since 


2+1 1 1 
k2(k+1)2 kb? (k+1)2’ 
SO l 
a : 
: (n +1)? 


6 


Differential calculus 


The precise definition of the notion of derivative, studying a function’s differenti- 
ability and computing its successive derivatives, the use of derivatives to analyse 
the local and global behaviours of functions are all constituents of Differential 
Calculus. 


6.1 The derivative 


We start by defining the derivative of a function. 

Let f : dom f C R > R be a real function of one real variable, take x) € dom f 
and suppose f is defined in a neighbourhood I[,(a09) of xp. With x € I,(xo0), x A xo 
fixed, denote by 


Ax =x -— 2X0 


the (positive or negative) increment of the independent variable between 
xo and x, and by 


Af = f(x) — f(xo) 
the corresponding increment of the dependent variable. Note that 7 = x + 
Ar, f(a) = f(ao) + Af. 
The ratio 


Af _ f(x)—f(%o) _ flto + Ax) — f (ao) 


yaGe XL — Xo AG 


is called difference quotient of f between x9 and z. 


In this manner Af represents the absolute increment of the dependent variable 
f when passing from xo to #9 + Az, whereas the difference quotient detects the 
rate of increment (while Af /f is the relative increment). Multiplying the difference 
quotient by 100 we obtain the so-called percentage increment. Suppose a rise by 
Az = 0.2 of the variable x prompts an increment Af = 0.06 of f; the difference 
quotient at equals 0.3 = a corresponding to a 30% increase. 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_6, 
© Springer International Publishing Switzerland 2015 


170 6 Differential calculus 


A 


Figure 6.1. Secant and tangent lines to the graph of f at Po 


Graphically, the difference quotient between x9 and a point x; around 2g is the 
slope of the straight line s passing through Py = (xo, f(o)) and P; = (x1, f(x1)), 
points that belong to the graph of the function; this line is called secant of the 
graph of f at Po and P, (Fig.6.1). Putting Ax = x; — x9 and Af = f(x1)—f (xo), 
the equation of the secant line reads 


Af 
y = s(x) = f(a) + Ag — £9), ceER. (6.1) 

x 
A typical application of the difference quotient comes from physics. Let M be 
a point-particle moving along a straight line; call s = s(t) the x-coordinate of the 
position of M at time t, with respect to a reference point O. Between the instants 
to and t; = to + At, the particle changes position by As = s(t,) — s(to). The 
difference quotient as represents the average velocity of the particle in the given 

interval of time. 


How does the difference quotient change, as Ax approaches 0? This is answered 
by the following notion. 


Definition 6.1 A map f defined on a neighbourhood of x9 € R is called 


differentiable at xo if the limit of the difference quotient ae between Xo 


and x exists and is finite, as x approaches x9. The real number 


(eS eo ion ito 42) f (Zo) 


/ 5 
fo) = lim = 
f ( 0) xL—>XO G2 = 4616) Az—0 Ax 


is called (first) derivative of f at xo. 


6.1 The derivative 171 


The derivative at xo is variously denoted, for instance also by 


y' (zo); PAGE Df(zxo). 
The first symbol goes back to Newton, the second is associated to Leibniz. 
From the geometric point of view f’(xq) is the slope of the tangent line at 
Po = (20, f(xo)) to the graph of f: such line t is obtained as the limiting position 
of the secant s at Po and P = y, F(2)); when P approaches Po. From (6.1) and 
the previous definition we have 


y =t(x) = f(xo) + f' (x0) (x — 20), ceER. 


_ LAs 
In the physical example given above, the derivative u(to) = s’(to) = jim a 
— 
is the instantaneous velocity of the particle M at time to. 


Let 
dom f’={xedomf : f is differentiable at x} 


and define the function f’: dom f’ CR—R, f’:a+ f(x) mapping x € dom f’ 
to the value of the derivative of f at x. This map is called (first) derivative of f. 


Definition 6.2 Let I be a subset of dom f. We say that f is differentiable 


on I (or in I) if f is differentiable at each point of I. 


A first yet significant property of differentiable maps is the following. 


Proposition 6.3 If f is differentiable at xo, it is also continuous at xo. 


Proof. Continuity at xo prescribes 


lie Fi) = F (ea), that is lim (f(x) — f(xo)) =0. 


i io at 20) xL—->XLO 


If f is differentiable at xo, then 


sim (C2) = f(¢0)) = Jim POLO) Ge) 
xL—->XO XL — Xo xx 


172 6 Differential calculus 


Not all continuous maps at a point are differentiable though. Consider the map 
f(x) = |2|: it is continuous at the origin, yet the difference quotient between the 
origin and a point « # 0 is 


Af  fix)—f(@) _ |z| ae ifx > 0, 


~'g  |-1 ife <0, 


Ac  «z—0 xz 


(6.2) 


so the limit for x — 0 does not exist. Otherwise said, f is not differentiable at 
the origin. This particular example shows that the implication of Proposition 6.3 
can not be reversed: differentiability is thus a stronger property than continuity, 
an aspect to which Sect. 6.3 is entirely devoted. 


6.2 Derivatives of the elementary functions. Rules of 
differentiation 


We begin by tackling the issue of differentiability for elementary functions using 
Definition 6.1. 
i) Consider the affine map f(x) = ax + b, and let rp € R be arbitrary. Then 
; (a(xo + Ax) + b) — (azo + b) 
= es ] = 
Piao) = 20 Ae Ahot~® 


in agreement with the fact that the graph of f is a straight line of slope a. The 
derivative of f(z) = axz+ 6 is then the constant map f’(x) = a. 
In particular if f is constant (a = 0), its derivative is identically zero. 


ii) Take f(x) = x? and zo € R. Since 


Ax—+0 Az Angee eae 0) 
the derivative of f(x) = 2? is the function f’(x) = 22. 
iii) Now let f(a) = 2” with n € N. The binomial formula (1.13) yields 


(to + Ax)” — 2G 


n _ ‘ 
f (#0) = pen Az 
a? + nat! Ag + Ss" ) af Any =a 
_ jj k=2 
Az+0 Az 


I| 
b= 
5 
——N 
3 
8 
os 
Ran 
a 
M: 
—N 
ees) 
Sa” 
8 
os 
= 
bb 
hae 
. 
a 
Ne 
I| 
= 
8 
o3 
i 


for all zo € R. Therefore, f’(x) = nx”—! is the derivative of f(r) = 2” . 


6.2 Derivatives of the elementary functions. Rules of differentiation 173 


iv) Even more generally, consider f(x) = x® where a € R, and let zr) #0 bea 
point of the domain. Then 


— 14 Az)" _4 
ov, Cot dno 7 i(1+42) -1] 
Pifo) = Ze =a Le 
a—l : (1+ 42) 1 
a act Ax 
xo 


Substituting y = ae brings the latter into the form of the fundamental limit 
(4.13), so 
f' (x0) = axe", 


When a > 1, f is differentiable at xo = 0 as well, and f’(0) = 0. The function 
f(x) = x® is thus differentiable at all points where the expression 2°~1 is well 
defined; its derivative is f’(r) = ar°"}. 

For example f(r) = \/x = «'/?, defined on [0, +00), is differentiable on (0, +00) 


with derivative f’(x) = i The function f(z) = Wa2> = °/8 is defined on R, 
© 
where it is also differentiable, and f’(x) = 32?/3 = 3 V2?. 


v) Now consider the trigonometric functions. Take f(x) = sinx and a € R. 
Formula (2.14) gives 


; _ sin(xp + Ax) — sin xo _ 2sin ae cos(a%o + at) 
f'(eo) = ka = s Vim 
Azx—-0 Ar Azx—-0 Ax 

i sin ae i if =) 
= lim im cos (x —). 
Ax—-0 Ax Azx—-0 2 


The limit (4.5) and the cosine’s continuity tell 
f'(xo) = cos 20. 


Hence the derivative of f(a) = sina is f’(x) = cosa. 
Using in a similar way formula (2.15), we can see that the derivative of f(x) = 
cosx is the function f’(a) = — sin. 


vi) Eventually, consider the exponential function f(#) = a”. By (4.12) we have 


qzot Ar — qo qQat -_ 
f'(zo) = lim_ ————— = a” lim 


__ 20 
—— =a lo a 
Az—0 Ax Ar>0 Ax 84: 


showing that the derivative of f(x) = a” is f’(x) = (loga)a”. 

As loge = 1, the derivative of f(x) = e® is f’(x) = e” = f(x), whence the 
derivative f’ coincides at each point with the function f itself. This is a crucial 
fact, and a reason for choosing e as privileged base for the exponential map. 


174 6 Differential calculus 


We next discuss differentiability in terms of operations (algebraic operations, 
composition, inversion) on functions. We shall establish certain differentiation 
rules to compute derivatives of functions that are built from the elementary ones, 
without resorting to the definition each time. The proofs may be found in Ap- 
pendix A.4.1, p. 449. 


Theorem 6.4 (Algebraic operations) Let f(x),g(x) be differentiable 
maps at to € R. Then the maps f(x) + g(x), f(x)g(x) and, if g(ao) 4 0, 
f(x) 
g(x) 


are differentiable at xo. To be precise, 


Corollary 6.5 (Linearity of the derivative) If f(x) and g(x) are differ- 
entiable at xo € R, the map af(x) + Gg(x) is differentiable at xo for any 
a,b €R and 


(af + Bg)'"(wo) = af"(xo) + Bg'(20). (6.6) 


Proof. Consider (6.4) and recall that differentiating a constant gives zero; then 
(af)'(ao) = af’(ao) and (8g9)'(ao) = 8g'(ao) follow. The rest is a con- 
sequence of (6.3). Oo 


Examples 6.6 


i) To differentiate a polynomial, we use the fact that Da” = nx"~! and apply 
the corollary repeatedly. So, f(x) = 3x° — 2x4 — 23 + 3x? — 5x + 2 differentiates 
to 


f'(x) =3- 5x4 — 2-49? — 3274+ 3-22 —5 = 1524 — 82° — 3x? + 6a — 5. 


ii) For rational functions, we compute the numerator and denominator’s deriv- 
atives and then employ rule (6.5), to the effect that 
x? —3xr+1 
Ca a 
has derivative 
1,  (2@ —3)(2a —1) — (a? —3e24+1)2 2x? - 2741 
Ia) = (22 — 1)2 ~ Ae? — doe +1 


6.2 Derivatives of the elementary functions. Rules of differentiation 175 


iii) Consider f(x) = x? sin x. The product rule (6.4) together with (sin x)! = cos x 
yield 
f'(x) = 327 sin x + x? cos z. 


iv) The function 


sin x 
je) =tane= 
COS x 
can be differentiated with (6.5) 
: ; 2 a) . 2 
cos x cosa — sinx (—sinx cos* « + sin’ x sin* x 
f(x) = ( Be ee SS | = 1+ tan? 2. 
cos? x cos? x cos? x 
Another possibility is to use cos? « + sin? z = 1 to obtain 
1 
/ 
Lr) = —>—. 
F(z) cos? x 


Theorem 6.7 (“Chain rule”) Let f(x) be differentiable at x) € R and g(y) 
a differentiable map at yo = f(xo). Then the composition go f(x) = 9( (x)) 


is differentiable at xo and 


(9° f)'(@o) = 9' (yo) f"(@0) = 9' (Ff (a0) f"(x0). (6.7) 


Examples 6.8 
i) The map h(x) = V1 — 2? is the composite of f(a) = 1 — x, whose derivative 
1 
is f’(x) = —2zx, and g(y) = ,/y, for which g’(y) = Ni Then (6.7) directly gives 
Y 


1 x 
ee ee 
Van oe 
ii) The function h(x) = e°°83” is composed by f(x) = cos3z, g(y) = e¥. But 
f(x) is in turn the composite of y(x) = 3a and v(y) = cosy; thus (6.7) tells 


f'(x) = —3sin3z. On the other hand g’(y) = e¥. Using (6.7) once again we 
conclude 


h'(x) = 


hi (a) = =3e°! ** sin 3a. Oo 


Theorem 6.9 (Derivative of the inverse function) Suppose f(x) is a 
continuous, invertible map on a neighbourhood of xo € R, and differentiable 
at xo, with f'(xo) # 0. Then the inverse map f—'(y) is differentiable at 


yo = f (xo), and 


(6.8) 


176 6 Differential calculus 


Examples 6.10 


i) The function y = f(x) = tanz has derivative f’(2) = 1+ tan? x and inverse 
xz = f—'(y) =arctany. By (6.8) 


1 1 
-ly _ _ ; 
(FY) 14+ tan? x 1+y? 
Setting for simplicity f~' = g and denoting the independent variable with 2, 
1 
the derivative of g(x) = arctan is the function g/(#) = ia 
x 


ii) We are by now acquainted with the function y = f(x) = sina: it is invertible 
on [—3, 3], namely x = f~'(y) = arcsiny. Moreover, f differentiates to f’(x) = 
cos x. Using cos? z + sin? xz = 1, and taking into account that on that interval 
cosxz > 0, one can write the derivative of f in the equivalent form f’(#) = 


V/1 —sin® x. Now (6.8) yields 
1 1 
CC) = 
J/1—sin? x yl 


Put once again f~! = g and change names to the variables: the derivative of 


Jae 


In similar fashion g(a) = arccosz differentiates to g'(x) = — 


g(x) = arcsin x is 9'(x) = 
1 

iii) Consider y = f(x) = a®. It has derivative f’(x) = (loga)a” and inverse 

xz = f—'(y) =log, y. The usual (6.8) gives 


1 1 
—1l\/ _ _— . 
(FY) = (loga)a® — (loga)y 
1 
Defining f~' = g and renaming z the independent variable gives g’ (x) = ———— 
(log a)x 


as derivative of g(x) = log, x (x > 0). 
Take now h(x) = log,(—x) (with « < 0), composition of x + —2 and g(y): then 


. Putting all together shows that g(x) = 


, 7 oe 1 
©) = Cogay(—a)) = Togaya 


log, |x| (c 4 0) has derivative g'(x) = (eae: 


1 
With the choice of base a = e the derivative of g(x) = log |z| is g(a) =—-. Oo 
it 


Remark 6.11 Let f(x) be differentiable and strictly positive on an interval I. 
Due to the previous result and the Chain rule, the derivative of the composite 


map g(x) = log f(z) is 
f'(2) 


f(x) 


g (x) = 


/ 
The expression 7 is said logarithmic derivative of the map f. 


6.3 Where differentiability fails 177 


The section ends with a useful corollary to the Chain rule 6.7. 


Property 6.12 /f f is an even (or odd) differentiable function on all its 


domain, the derivative f' is odd (resp. even). 


Proof. Since f is even, f(—x) = f(x) for any x € dom f. Let us differentiate both 
sides. As f(—x) is the composition of x ++ —x and y+> f(y), its derivative 
reads —f’(—ax). Then f’(—x) = —f'(x) for all x € dom f, so f’ is odd. 
Similarly if f is odd. 


We reckon it could be useful to collect the derivatives of the main elementary 
functions in one table, for reference. 


Das Saas (Va € R) 
D sinz = cosxz 


D cosx = — sina 


iL 


D tang =1+tan?2 = 5 
cos? x 


D arcsinz = 


Vie 


D arccosz = — 


= 


D arct = 
arctan 2 = 75 
Mia — Mos a \as in particular, De® = e* 
1 


ese Toxaya 


1 
in particular, D log|z| = — 
x 


6.3 Where differentiability fails 


It was noted earlier that the function f(x) = |x| is continuous but not differentiable 
at the origin. At each other point of the real line f is differentiable, for it coincides 
with the line y = x when z > 0, and with y = —z for x < 0. Therefore f’(x) = +1 


178 6 Differential calculus 


for x > 0 and f’(#) = —1 on x < O. The reader will recall the sign function 
(Example 2.1 iv)), for which 


D |x| = sign(z), for all x £0. 


The origin is an isolated point of non-differentiability for y = |z]. 
Returning to the expression (6.2) for the difference quotient at the origin, we 
observe that the one-sided limits exist and are finite: 
Af ne, Ol 


=! 


im —_—_ = 
z—0t+ Ax , 


This fact suggests us to introduce the following notion. 


Definition 6.13 Suppose f is defined on a right neighbourhood of xo € R. It 
is called differentiable on the right at x9 if the right limit of the difference 


A 
quotient ~ between xo and x exists finite, for x approaching xo. The real 
Ne 


number 


' De, OS) ee es) 
F(®0) = Baer = eae Ae 


is the right (or backward) derivative of f at xo. Similarly it goes for the 
left (or forward) derivative f! (xo). 


If f is defined only on a right (resp. left) neighbourhood of xg and is differenti- 
able on the right (resp. the left) at zo, we shall simply say that f is differentiable 
at xo, and write f’(%o) = f’.(@o) (resp. f’(@o) = fL(xo)). 

From Proposition 3.24 the following criterion is immediate. 


Property 6.14 A map f defined around a point xo € R is differentiable at 
xo if and only if it is differentiable on both sides at x9 and the left and right 


derivatives coincide, in which case 


f' (to) = fi (0) = Ff (xo). 


Instead, if f is differentiable at x9 on the left and on the right, but the two 
derivatives are different (as for f(x) = |x| at the origin), xo is called corner 
(point) for f (Fig. 6.2). The term originates in the geometric observation that the 
right derivative of f at xo represents the slope of the right tangent to the graph 
of f at Po = (xo, f(xo)), ie., the limiting position of the secant through Po and 
P = (a, f(x)) as x > x approaches Zo. In case the right and left tangent (similarly 
defined) do not coincide, they form an angle at Pp. 


6.3 Where differentiability fails 179 


Figure 6.2. Non-differentiable maps: the origin is a corner point (left), a point with 
vertical tangent (middle), a cusp (right) 


Other interesting cases occur when the right and left limits of the difference 
quotient of f at x79 exist, but one at least is not finite. These will be still denoted 
by fi.(%o) and f’ (xo). 

Precisely, if just one of f! (ao), f! (xo) is infinite, we still say that xo is a corner 
point for f. 

If both f/ (ao) and f! (#9) are infinite and with same sign (hence the limit of 
the difference quotient is +-co or —oo), xo is a point with vertical tangent for 
f. This is the case for f(x) = Wa: 

oa ih 


(0) = lim, = hi = +00. 
= ee eye 


When f! (vo), fi (@o) are finite and have different signs, x9 is called a cusp 


(point) of f. For instance the map f(x) = \/|z| has a cusp at the origin, for 


#00) = tim VELL yy, VL _ 


7 = lim — , = +40. 
= z 70+ 2 2—0* sign(z) |x| — 2+0* sign(x) y/|2| 


Another criterion for differentiability at a point xo is up next. The proof is 
deferred to Sect.6.11, for it relies on de l’H6pital’s Theorem. 


Theorem 6.15 Let f be continuous at xo and differentiable at all points 
x # xo in a neighbourhood of xo. Then f is differentiable at xo provided that 
the limit of f'(x) for x > xo exists finite. If so, 


f' (xo) = lim pal 1b 


xL—->XO 


Example 6.16 


We take the function 
asin 2a —4 ine 0), 


bia —1)+e” ifa>0, 


fe) = { 


180 6 Differential calculus 


and ask ourselves whether there are real numbers a and b rendering f differen- 
tiable at the origin. The continuity at the origin (recall: differentiable implies 
continuous) forces the two values 


Jim f(z) =-4, lim, f(z) = f(0) =-b+1 


to agree, hence b = 5. With b fixed, we may impose the equality of the right 
and left limits of f’(x) for x — 0, to the effect that f’(x) admits finite limit for 
x — 0. Then we use Theorem 6.15, which prescribes that 


lim /'(¢) = lim 2acos2¢ = 2a, and lim f’(x) m (5+e”") =6 
xz—0- x—0- xz—0t oe 


= hi 
x0 


are the same, so a = 3. 


Remark 6.17 In using Theorem 6.15 one should not forget to impose continuity 
at the point zo. The mere existence of the limit for f’ is not enough to guarantee 
f will be differentiable at xo. For example, f(x) = x + sign is differentiable at 
every x #0: since f’(x) = 1, it necessarily follows lim, f'(x) = 1. The function is 


nonetheless not differentiable, because not continuous, at x = 0. 


6.4 Extrema and critical points 


Definition 6.18 One calls x9 € domf a relative (or local) maximum 
point for f if there is a neighbourhood I,.(ao) of x9 such that 


Vx € I,.(%9) dom f, fle) Siro): 


Then f(xo) is a relative (or local) maximum of f. 
One calls x9 an absolute maximum point (or global maximum point) 
for f if 

Veedomf, f(x) < f(#o), 


and f (xo) becomes the (absolute) maximum of f. In either case, the maz- 
imum is said strict if f(x) < f(ao) when x F 20. 


Exchanging the symbols < with > one obtains the definitions of relative and 
absolute minimum point. A minimum or maximum point shall be referred to 
generically as an extremum (point) of /. 


Examples 6.19 


i) The parabola f(x) = 1+22—a? = 2—(x2—1)? has astrict absolute maximum 
point at x9 = 1, and 2 is the function’s absolute maximum. Notice the derivative 
f'(a) = 2(1—2) is zero at that point. There are no minimum points (relative or 
absolute). 


6.4 Extrema and critical points 181 


Xo 


Figure 6.3. Types of maxima 


ii) For g(x) = arcsin x (see Fig. 2.24), a9 = 1 is a strict absolute maximum point, 
with maximum value >. The point 7; = —1 is a strict absolute minimum, with 
value —5. At these extrema g is not differentiable. 


We are interested in finding the extremum points of a given function. Provided 
the latter is differentiable, it might be useful to look for the points where the first 
derivative vanishes. 


Definition 6.20 A critical point (or stationary point) of f is a point xo 


at which f is differentiable with derivative f’(xo) = 0. 


The tangent at a critical point is horizontal. 


Figure 6.4. Types of critical points 


Theorem 6.21 (Fermat) Suppose f is defined in a full neighbourhood of a 
point xo and differentiable at xo. If xo is an extremum point, then it is critical 


NOk gees: 


7 (25) = (i), 


182 6 Differential calculus 


Proof. To fix ideas, assume Zo is a relative maximum point and that [,(xo) is a 
neighbourhood where f(x) < f(xo) for all x € I,(ao). On such neighbour- 
hood then Af = f(x) —f (ao) < 0. 


A 
If « > xo, hence Ar = x — ro > O, the difference quotient = is non- 
xv 


positive. Corollary 4.3 implies 


; rat ae 
Vice versa, if x < ro, i.e., Ax < 0, then a is non-negative, so 
x 


xs f(x) — f(xo) 


LL L— XO 


2, 


By Property 6.14, 


fae) = Yim LOV= FC) _ jpg Fl@) = F(2o) 


3 
aad L— Xo L+25 «L— Xo 


so f’(xo) is simultaneously < 0 and > 0, hence zero. 
A similar argument holds for relative minima. 


Fermat’s Theorem 6.21 ensures that the extremum points of a differentiable 
map which belong to the interior of the domain should be searched for among 
critical points. 

A function can nevertheless have critical points that are not extrema, as in 
Fig. 6.4. The map f(x) = x? has the origin as a critical point (f’(x) = 327 = 0 if 
and only if « = 0), but admits no extremum since it is strictly increasing on the 
whole R. 

At the same time though, a function may have non-critical extremum point 
(Fig. 6.3); this happens when a function is not differentiable at an extremum that 
lies inside the domain (e.g. f(«) = |x|, whose absolute minimum is attained at the 
origin), or when the extremum point is on the boundary (as in Example 6.19 ii)). 
The upshot is that in order to find all extrema of a function, browsing through 
the critical points might not be sufficient. 


To summarise, extremum points are contained among the points of the domain 
at which either 


i) the first derivative vanishes, 


ii) or the function is not differentiable, 


iii) or among the domain’s boundary points (inside R). 


6.5 Theorems of Rolle, Lagrange, and Cauchy 183 
6.5 Theorems of Rolle, Lagrange, and Cauchy 


We begin this section by presenting two theorems, Rolle’s Theorem and Lagrange’s 
or Mean Value Theorem, that are fundamental for the study of differentiable maps 
on an interval. 


Theorem 6.22 (Rolle) Let f be a function defined on a closed bounded 
interval [a,b], continuous on |a,b| and differentiable on (a,b) (at least). If 
f(a) = f(b), there exists an xo € (a,b) such that 


f'(xo) = 0. 


In other words, f admits at least one critical point in (a,b). 


Proof. By the Theorem of Weierstrass the range f([a,b]) is the closed interval 
[m, M] bounded by the minimum and maximum values m, M of the map: 


m= ae f(z) =f(tm), M= a f(z) = f(zm), 


for suitable tm, rm € [a,b]. 
In case m = M, f is constant on [a,b], so in particular f’(a) = 0 for any 
x € (a,b) and the theorem follows. 

Suppose then m < M. Since m < f(a) = f(b) < M, one of the strict 
inequalities f(a) = f(b) <M, m< f(a) = f(b) will hold. 

If f(a) = f(b) < M, the absolute maximum point x), cannot be a nor 0; 
thus, x € (a, 0) is an interior extremum point at which f is differentiable. 
By Fermat’s Theorem 6.21 we have that x);y = %o is a critical point. 

If m < f(a) = f(b), one proves analogously that x, is the critical point 
xo of the claim. 


The theorem proves the existence of one critical point in (a, b); Fig. 6.5 shows that 
there could actually be more. 


a Xo b 


Figure 6.5. Rolle’s Theorem 


184 6 Differential calculus 


Theorem 6.23 (Mean Value Theorem or Lagrange Theorem) Let f 
be defined on the closed and bounded interval |a, b|, continuous on |a, b| and 
differentiable (at least) on (a,b). Then there is a point xo € (a,b) such that 


f(b) — f(a) 
b-a 


= f"(xo). (6.9) 


Every such point xo we shall call Lagrange point for f in (a,b). 


Proof. Introduce an auxiliary map 


defined on [a,b]. It is continuous on [a,b] and differentiable on (a,b), as 
difference of f and an affine map, which is differentiable on all of R. Note 


f= f@. 


g{(0) = f'(@) - 


It is easily seen that 


so Rolle’s Theorem applies to g, with the consequence that there is a point 
xo € (a,b) satisfying 


g' (to) = f'(%0) - = = 


But this is exactly (6.9). 


f(0), 


f(a) 


b 


Figure 6.6. Lagrange point for f in (a,b) 


6.5 Theorems of Rolle, Lagrange, and Cauchy 185 


The meaning of the Mean Value Theorem is clarified in Fig.6.6. At each Lag- 
range point, the tangent to the graph of f is parallel to the secant line passing 
through the points (a, f(a)) and (0, f(b). 


Example 6.24 


Consider f(a) = 1+a+ V1 -—2?, a continuous map on its domain [—1, 1] as 
composite of elementary continuous functions. It is also differentiable on the 
open interval (—1,1) (not at the end-points), in fact 
Ca 

Sia ea 
Thus f fulfills the Mean Value Theorem’s hypotheses, and must admit a Lag- 
range point in (—1,1). Now (6.9) becomes 
POT). a 
ces aie Ae ee se 

i=) f'(xo) 


ZO 


1= 


satisfied by xp = 0. 


The following result is a generalisation of the Mean Value Theorem 6.23 (which 
is recovered by g(x) = x in its statement). It will be useful during the proofs 
of de l’Hopital’s Theorem 6.41 and Taylor’s formula with Lagrange’s remainder 
(Theorem 7.2). 


Theorem 6.25 (Cauchy) Let f and g be maps defined on the closed, 
bounded interval [a,b], continuous on [a,b] and differentiable (at least) on 
(a,b). Suppose g'(x) £0 for all x € (a,b). Then there exists xo € (a,b) such 


that 
f(b) — f(a) 


(6.10) 


Proof. Note first that g(a) 4 g( 
vanish somewhere in (a, b 
Take the function 


b), otherwise Rolle’s Theorem would have g’(x) 
), against the assumption. 


f= 


defined on [a,b]. It is continuous on [a,b] and differentiable on the open 
interval (a,b), because difference of maps with those properties. Moreover 


186 6 Differential calculus 


the map h satisfies Rolle’s Theorem, so there must be a point 29 € (a, b) 


with ; 
hi(a0) = f"(00) - SOS 


which is exactly (6.10). Oo 


(x0) — iD; 


6.6 First and second finite increment formulas 


We shall discuss a couple of useful relations to represent how a function varies 
when passing from one point to another of its domain. 
Let us begin by assuming f is differentiable at xo. By definition 


lin f(x) — f(xo) 


xwL—->XO C= ZO 


= 7 (Ga); 


that is to say 


tn (LL LD ey)) = yy LO) = flee) — Flea 20) _ 
LX L— Xo L+XLO wt — XO 
Using the Landau symbols of Sect. 5.1, this becomes 
i(a)= {(@0) =F (ea)(a —20) =0@=—a9),, @ ao. 
An equivalent formulation is 
f(x) — f(xo) = f'(ao)(w@ — 20) + o(x@ — 20), > Xo, (6.11) 
or 
Af = f'(ao)Av+o(Az), Ax 0, (6.12) 


by putting Ax = x — x and Af = f(x) — f(x). 

Equations (6.11)-(6.12) are equivalent writings of what we call the first formula 
of the finite increment , the geometric interpretation of which can be found in 
Fig. 6.7. It tells that if f’(ao) 4 0, the increment Af, corresponding to a change 
Az, is proportional to Az itself, if one disregards an infinitesimal which is negligible 
with respect to Ax. For Ax small enough, in practice, Af can be treated as 


7 (ag ae. 


Now take f continuous on an interval J of R and differentiable on the interior 
points. Fix 7, < #2 in J and note that f is continuous on [x 1, x2] and differentiable 
on (#1, £2). Therefore f, restricted to [x1, x2], satisfies the Mean Value Theorem, 
so there is % € (a1, 22) such that 


f (a2) — f(z1) 


L2— XY 


= f'(z), 


6.6 First and second finite increment formulas 187 


A 
y = f(z) 
A 
ree o( Az) 
As 
f' (ao) Az 
f (xo) 
Ax 
y = U(x) 
XO vo + Ax 


Figure 6.7. First formula of the finite increment 


that is, a point Z € (1,22) with 


Ga) =a) (lee =a) (6.13) 


We shall refer to this relation as the second formula of the finite increment. 
It has to be noted that the point Z depends upon the choice of 7, and 22, albeit 
this dependency is in general not explicit. The formula’s relevance derives from 
the possibility of gaining information about the increment f (#2) — f(x1) from the 
behaviour of f’ on the interval [x1, x2]. 

The second formula of the finite increment may be used to describe the local 
behaviour of a map in the neighbourhood of a certain x9 with more precision than 
that permitted by the first formula. Suppose f is continuous at xo and differentiable 
around x9 except possibly at the point itself. If x is a point in the neighbourhood 
of x, (6.13) can be applied to the interval bounded by zg and z, to the effect that 


Af = f'(@) Ae, (6.14) 


where Z lies between xp and x. This alternative formulation of (6.13) expresses the 
increment of the dependent variable Af as if it were a multiple of Az; at closer 
look though, one realises that the proportionality coefficient, i.e., the derivative 
evaluated at a point near 79, depends upon Az (and on xo), besides being usually 
not known. 


A further application of (6.13) is described in the next result. This will be 
useful later. 


188 6 Differential calculus 


Property 6.26 A function defined on a real interval I and everywhere differ- 


entiable is constant on I if and only if its first derivative vanishes identically. 


Proof. Let f be the map. Suppose first f is constant, therefore for every xo € J, 
f(x) — f (xo) 
= 2H 

f'(xo) = 0 by definition of derivative. 
Vice versa, suppose f has zero derivative on J and let us prove that f is 
constant on J. This would be equivalent to demanding 


Fig) = Fes), Voi, roe I, 


the difference quotient , with « € I, x ¥ Zo, is zero. Then 


Take 21,22 € J and use formula (6.13) on f. For a suitable % between 
X1,22, we have 


f (ve) — f(vi) = f'(@)(eo — 21) = 0, 
thus f(r) = f(£2). 


6.7 Monotone maps 


In the light of the results on differentiability, we tackle the issue of monotonicity. 


Theorem 6.27 Let I be an interval upon which the map f is differentiable. 
Then: 


a) If f is increasing on I, then f'(x) >0 for alla € I. 


b1) If f'(x) > 0 for any x € I, then f is increasing on I; 
b2) af f'(x) > 0 for all x € I, then f is strictly increasing on I. 


Proof. Let us prove claim a). Suppose f increasing on J and consider an interior 
point x9 of J. For all x € J such that x < x9, we have 


f(x) — f(ao) <0 and — Zo < 0. 
eg ' 
Thus, the difference quotient ne between xp and x is non-negative. On 
oe 
the other hand, for any x € I with x > 20, 
f(x) — f(ao) > 0 and L— Xo > 0. 


A 
Here too the difference quotient a between Xo and 2 is positive or zero. 
we 
Altogether, 


6.7 Monotone maps 189 


f(#1) 


Corollary 4.3 on 


yields f’(ao) > 0. As for the possible extremum points in J, we arrive 
at the same conclusion by considering one-sided limits of the difference 
quotient, which is always > 0. 

Now to the implications in parts b). Take f with f’(x) > 0 for all x € J. 
The idea is to fix points 71 < x2 in J and prove that f(r1) < f(z). 
For that we use (6.13) and note that f’(z) > 0 by assumption. But since 
LQ — £1 > 0, we have 


f (a2) — f(x1) = f'(Z) (x2 — 21) > 0, 


proving b1). Considering f such that f’(a) > 0 for all « € I instead, (6.13) 
implies f(x2) — f(a1) > 0, hence also 62) holds. Oo 


The theorem asserts that if f is differentiable on J, the following logic equival- 
ence holds: 


f'(z)>0, Vael <> | f is increasing on J. 
Furthermore, 
fi(z)>0, Vael =f is strictly increasing on J. 
The latter implication is not reversible: f strictly increasing on I does not imply 
f'(x) > 0 for all x € I. We have elsewhere observed that f(x) = x? is everywhere 
strictly increasing, despite having vanishing derivative at the origin. 


A similar statement to the above holds if we change the word ‘increasing’ with 
‘decreasing’ and the symbols >, > with <, <. 


190 6 Differential calculus 


Corollary 6.28 Let f be differentiable on I and xo an interior critical point. 
If f'(x) > 0 at the left of xo and f'(x) < 0 at its right, then xo is a maximum 


point for f. Similarly, f’(a) < 0 at the left, and > 0 at the right of xo implies 
Xo 18 a minimum point. 


Theorem 6.27 and Corollary 6.28 justify the search for extrema among the 
zeroes of f’, and explain why the derivative’s sign affects monotonicity intervals. 


Example 6.29 


The map f:R—R, f(x) = xe?* differentiates to f’(x) = (2x + 1)e?”, whence 
n= —4 is the sole critical point. As f’(x) > 0 if and only if « > —, f (xo) is an 
absolute minimum. The function is strictly decreasing on (—oo, —5] and strictly 


L 
. . 1 2 
increasing on [—5, +00). 


6.8 Higher-order derivatives 


Let f be differentiable around zo and let its first derivative f’ be also defined 
around Zo. 


Definition 6.30 Jf f’ is a differentiable function at xo, one says f is twice 
differentiable at xo. The expression 


f" (20) = (f')'(x0) 


is called second derivative of f at x 9. The second derivative of f, 
denoted f", is the map associating to x the number f’"(x), provided the latter 
is defined. 


Other notations commonly used for the second derivative include 


y" (xo); 
The third derivative, where defined, is the derivative of the second derivative: 


f"'(w0) = (F")" (20) - 


In general, for any k > 1, the derivative of order k (kth derivative) of f at 
xo is the first derivative, where defined, of the derivative of order (k — 1) of f at 
ZO: 


f(x) = (FOP) (wo). 


6.8 Higher-order derivatives 191 


Alternative symbols are: 


y (x0), 


For conveniency one defines f (a9) = f(ao) as well. 


—-(x0), D* f(a). 


Examples 6.31 
We compute the derivatives of all orders for three elementary functions. 


i) Choose n € N and consider f(x) = 2”. Then 
n! 


More concisely, 


with 0 < k < n. Furthermore, frre = 0 for any x € R (the derivative of 
the constant function f”) (a) is 0), and consequently all derivatives f) of order 
k > n exist and vanish identically. 

ii) The sine function f(x) = sina satisfies f’(z) = cosa, f"(x) = —sing, 
f(x) = —cosx and f(x) = sinx. Successive derivatives of f clearly re- 
produce this cyclical pattern. The same phenomenon occurs for y = cos”. 

iii) Because f(x) = e® differentiates to f’(x) = e*, it follows that f(x) = e” 
for every k > 0, proving the remarkable fact that all higher-order derivatives of 
the exponential function are equal to e”. 


A couple of definitions wrap up the section. 


Definition 6.32 A map f is of class C* (k > 0) on an interval I if f is 
differentiable k times everywhere on I and its kth derivative f*) is continuous 


on I. The collection of all C’ maps on I is denoted by C¥(1). 
A map f is of class C® on I if it 1s arbitrarily differentiable everywhere on 
I. One indicates by C°°(L) the collection of such maps. 


In virtue of Proposition 6.3, if f € C*(Z) all derivatives of order smaller or 
equal than k are continuous on J. Similarly, if f € C*°(J), all its derivatives are 
continuous on I. 

Moreover, the elementary functions are differentiable any number of times (so 
they are of class C°°) at every interior point of their domains. 


192 6 Differential calculus 


6.9 Convexity and inflection points 


Let f be differentiable at the point x9 of the domain. As customary, we indicate 
by y = t(x) = f(xo) + f’(xo0)(a@ — x0) the equation of the tangent to the graph of 
f at Xo. 


Definition 6.33 The map f is convex at xo if there is a neighbourhood 
I,(a0) C dom f such that 


Va € I(x), 


f is strictly convex if f(x) > t(z), 


The definitions for concave and strictly concave functions are alike (just change 
> > 40S <). 

What does this say geometrically? A map is convex at a point if around that 
point the graph lies ‘above’ the tangent line, concave if its graph is ‘below’ the 
tangent (Fig. 6.9). 


Example 6.34 
We claim that f(x) = x? is strictly convex at 29 = 1. The tangent at the given 
point has equation 

t(z) =14+2(¢@-—1)=2r-1. 
Since f(x) > t(x) means x? > 2x — 1, hence x? — 24 + 1 = (x — 1)? > 0, t lies 
below the graph except at the touching point x = 1. 


Definition 6.35 A differentiable map f on an interval I is convex on I if 


it is convex at each point of I. 


For understanding convexity, inflection points play a role reminiscent of ex- 
tremum points for the study of monotone functions. 


y = f(z) 
y =t(z) 


y = t(z) 
y = f(x) 


> 


Xo ZO 


Figure 6.9. Strictly convex (left) and strictly concave (right) maps at xo 


6.9 Convexity and inflection points 193 


Definition 6.36 The point xo is an inflection point for f if there is a 
neighbourhood I,.(%9) C dom f where one of the following conditions holds: 


either 
UTE 


Va € I(x), 


i Soy, 
ag. 


Pe cle 


Va € I(x), 


In the former case we speak of an ascending inflection, in the latter the 
inflection is descending. 


In the plane, the graph of f ‘cuts through’ the inflectional tangent at an in- 
flection point (Fig. 6.10). 


The analysis of convexity and inflections of a function is helped a great deal 
by the next results. 


Theorem 6.37 Given a differentiable map f on the interval I, 


a) if f is conver on I, then f' is increasing on I. 
b1) If f' is increasing on I, then f is convex on I; 


b2) if f’ is strictly increasing on I, then f is strictly convex on I. 


Proof. See Appendix A.4.3, p. 455. 


Xo ZO 


Figure 6.10. Ascending (left) and descending (right) inflections at xo 


194 6 Differential calculus 


Corollary 6.38 If f is differentiable twice on I, then 


a) f conver on I implies f(x) >0 for alla € I. 


b1) f" (x) > 0 for alla € I implies f convex on I; 
b2) f"(«) >0 for all ax € I implies f strictly convex on I. 


Proof. This follows directly from Theorem 6.37 by applying Theorem 6.27 to the 
function f’. 


There is a second formulation for this, namely: under the same hypothesis, the 
following formulas are true: 


f"(x)>0, Vee lI <=> f isconvexon! 


and 
f"(xz)>0, VeelI = > f is strictly convex on J. 


Here, as in the characterisation of monotone functions, the last implication has no 
reverse. For instance, f(x) = x4 is strictly convex on R, but has vanishing second 
derivative at the origin. 

Analogies clearly exist concerning concave functions. 


Corollary 6.39 Let f be twice differentiable around xo. 


a) If xo is an inflection point, then f(a) = 0. 


b) Assume f"(xo) = 0. If f” changes sign when crossing Xo, then xo is an 


inflection point (ascending if f(x) <0 at the left of xo and f"(x) > 0 at 
its right, descending otherwise). If f’ does not change sign, xo is not an 
inflection point. 


The proof relies on Taylor’s formula, and will be given in Sect. 7.4. 


The reader ought to beware that f”(z9) = 0 does not warrant xo is a point 
of inflection for f. The function f(x) = x* has second derivative f(a) = 12a? 
which vanishes at 79 = 0. The origin is nonetheless not an inflection point, for 
the tangent at x9 is the axis y = 0, and the graph of f stays always above it. In 
addition, f” does not change sign around xo. 


Example 6.29 (continuation) 
For f(x) = xe? we have f’(x) = 4(r+1)e?* vanishing at x1 = —1. As f"(x) > 0 
if and only if « > —1, f is strictly concave on (—oo, —1) and strictly convex on 
(—1,+00). The point 7; = —1 is an ascending inflection. The graph of f(x) is 
shown in Fig. 6.11. O 


6.9 Convexity and inflection points 195 


A 


Figure 6.11. Example 6.29 


6.9.1 Extension of the notion of convexity 


The geometrical nature of convex maps manifests itself by considering a gener- 
alisation of the notion given in Sect.6.9. Recall a subset C' of the plane is said 
convex if the segment P, Ps between any two points P,, P2 € C is all contained 
in C. 

Given a function f : J C R > R, we denote by 


Es ={(z,y) €R’: cel, y> f(x)} 


the set of points of the plane lying above the graph of f (as in Fig. 6.12, left). 


Definition 6.40 The map f : I C R > R is called convex on I if the set 


Ey is a conver subset of the plane. 


It is easy to convince oneself that the convexity of Hy can be checked by 
considering points P,, P2 belonging to the graph of f only. In other words, given 


YU UY 


y= |z| 


Figure 6.12. The set Ey for a generic f defined on I (left) and for f(x) = |z| (right) 


196 6 Differential calculus 


1,22 in I, the segment Sj2 between (x1, f(r1)) and (a2, f(x2) should lie above 
the graph. 
Since one can easily check that any x between x; and x2 can be represented as 
i ae 


x=(1—t)ai +tre with (=e 6.1), 
L2— 21 


the convexity of f reads 
race = t)axy + tx) < qd = t) f (a1) + tf (x2) V1, QE I ,Vt E (0, 1] é 


If the inequality is strict for x; # x2 and t € (0,1), the function is called strictly 
convex on /. 

For differentiable functions on the interval J, Definitions 6.40, 6.33 can be 
proven to be equivalent. But a function may well be convex according to Defin- 
ition 6.40 without being differentiable on J, like f(x) = |x| on J = R (Fig. 6.12, 
right). Note, however, that convexity implies continuity at all interior points of J, 
although discontinuities may occur at the end-points. 


6.10 Qualitative study of a function 


We have hitherto supplied the reader with several analytical tools to study a 
map f on its domain and draw a relatively thorough — qualitatively speaking — 
graph. This section describes a step-by-step procedure for putting together all the 
information acquired. 


Domain and symmetries 
It should be possible to determine the domain of a generic function starting from 
the elementary functions that build it via algebraic operations and composition. 
The study is greatly simplified if one detects the map’s possible symmetries and 
periodicity at the very beginning (see Sect. 2.6). For instance, an even or odd map 
can be studied only for positive values of the variable. We point out that a function 
might present different kinds of symmetries, like the symmetry with respect to a 
vertical line other than the y-axis: the graph of f(x) = e7!*~?! is symmetric with 
respect to x = 2 (Fig. 6.13). 

For the same reason the behaviour of a periodic function is captured by its 
restriction to an interval as wide as the period. 


Behaviour at the end-points of the domain 

Assuming the domain is a union of intervals, as often happens, one should find the 
one-sided limits at the end-points of each interval. Then the existence of asymp- 
totes should be discussed, as in Sect. 5.3. 

For instance, consider 

_ log(2 — a) 


f(x) apr 


6.10 Qualitative study of a function 197 


Figure 6.13. The function f(x) = e |2-2| 


Now, log(2 — x) is defined for 2—2 > 0, or x < 2; in addition, Vx? — 2x has 

domain x? — 22 > 0, so x < 0 or x > 2, and being a denominator, x #~ 0,2. 

Thus dom f = (—oo,0). Since lim f(x) = +00, the line x = 0 is a vertical left 
x07 


log(2 — 
asymptote, while lim f(x) = lim log(2. x) 
L—>—00 xZ—>—00 |z| 


= 0 yields the horizontal left 
asymptote y = 0. 


Monotonicity and extrema 

The first step consists in computing the derivative f’ and its domain dom f’. Even 
if the derivative’s analytical expression might be defined on a larger interval, one 
should in any case have dom f’ C dom f. For example f(x) = log has f’(x) = + 
and dom f = dom f’ = (0,+00), despite g(x) = 4 makes sense for any x # 0. 
After that, the zeroes and sign of f’ should be determined. They allow to find the 
intervals where f is monotone and discuss the nature of critical points (the zeroes 
of f’), in the light of Sect. 6.7. 

A careless analysis might result in wrong conclusions. Suppose a map f is 
differentiable on the union (a,b) U (b,c) of two bordering intervals where f’ > 0. 
If f is not differentiable at the point b, deducing from that that f is increasing 
on (a,b) U (b,c) is wrong. The function f(x) = —+ satisfies f’(x) = 4 > 0 on 
(—oco, 0) U (0,+00), but it is not globally increasing therein (e.g. f(—1) > f(1)); 
we can only say f is increasing on (—oo,0) and on (0,+00) separately. 

Recall that extremum points need not only be critical points. The function 
j= (7a —: defined on x > 0, has a critical point x = 1 giving an abso- 
lute maximum. At the other extremum x = 0, the function is not differentiable, 
although f(0) is the absolute minimum. 


Convexity and inflection points 

Along the same lines one determines the intervals upon which the function is 
convex or concave, and its inflections. As in Sect. 6.9, we use the second derivative 
for this. 


Sign of the function and its higher derivatives 
When sketching the graph of f we might find useful (not compulsory) to establish 
the sign of f and its vanishing points (the z-coordinates of the intersections of the 


198 6 Differential calculus 


graph with the horizontal axis). The roots of f(x) = 0 are not always easy to find 
analytically. In such cases one may resort to the Theorem of existence of zeroes 
4.23, and deduce the presence of a unique zero within a certain interval. Likewise 
can be done for the sign of the first or second derivatives. 

The function f(x) = xloga — 1 is defined for x > 0. One has f(x) < 0 when 
x <1.On az > 1 the map is strictly increasing (in fact f’(x) = logx + 1 > 0 for 
x > 1/e); besides, f(1) = —1 < 0 and f(e) =e—1 > 0. Therefore there is exactly 
one zero somewhere in (1,e), f is negative to the left of said zero and positive to 
the right. 


6.10.1 Hyperbolic functions 


An exemplary application of what seen so far is the study of a family of functions, 
called hyperbolic, that show up in various concrete situations. 
We introduce the maps f(«) = sinhaz and g(a) = cosh by 


They are respectively called hyperbolic sine and hyperbolic cosine. The ter- 
minology stems from the fundamental relation 


cosh? x — sinh? x = 1, Ve ER, 


whence the point P of coordinates (X,Y) = (coshz, sinh x) runs along the right 
branch of the rectangular hyperbola X? — Y? = 1 as x varies. 

The first observation is that dom f = domg = R; moreover, f(x) = —f(—2) 
and g(x) = g(—2), hence the hyperbolic sine is an odd map, whereas the hyperbolic 
cosine is even. Concerning the limit behaviour, 


lim sinha = -Eco, lim cosha = +00. 
«wL—-ro0o Lx 0O 


This implies that there are no vertical nor horizontal asymptotes. No oblique 
asymptotes exist either, because these functions behave like exponentials for 7 > 
oo. More precisely 


1 
inha ~ += 
sinh & 5° 


aa L—-> OO. 


il 
haw —elzl 
cosha~ se”, 
It is clear that sinha = O if and only if « = 0, sinha > 0 when x > 0, while 
cosh xz > 0 everywhere on R. The monotonic features follow easily from 


D sinh x = coshx and Dcoshz = sinha, VaeeER. 


Thus the hyperbolic sine is increasing on the entire R. The hyperbolic cosine is 
strictly increasing on [0,-++oo) and strictly decreasing on (—oo, 0], has an absolute 
minimum cosh0 = 1 at z = 0 (so cosha > 1 on R). 


6.10 Qualitative study of a function 199 


A A 


Figure 6.14. Hyperbolic sine (left) and hyperbolic cosine (right) 


Differentiating once more gives 
D? sinh x = sinhx and D? cosh = coshz, Ve eR, 


which says that the hyperbolic sine is strictly convex on (0,+0co) and strictly 
concave on (—oo,0). The origin is an ascending inflection point. The hyperbolic 
cosine is strictly convex on the whole R. The graphs are drawn in Fig. 6.14. 

In analogy to the ordinary trigonometric functions, there is a hyperbolic 
tangent defined as 


sinh x al 


tanhz = = : 
cosha e2%+1 


Its domain is R, it is odd, strictly increasing and ranges over the open interval 
(—1,1) (Fig. 6.15). 

The inverse map to the hyperbolic sine, appropriately called inverse hyper- 
bolic sine, is defined on all of IR, and can be made explicit by means of the 
logarithm (inverse of the exponential) 


sinh’ & = log(a + Va? +1), ceER. (6.15) 


Figure 6.15. Hyperbolic tangent 


200 6 Differential calculus 


There normally is no confusion with the reciprocal 1/sinhz, whence the use of 
notation!. The inverse hyperbolic cosine is obtained by inversion of the hyper- 
bolic cosine restricted to [0, +00) 


cosh” * # = log(x + Vx? — 1), x € [1,+00). (6.16) 


To conclude, the inverse hyperbolic tangent inverts the corresponding hyper- 
bolic map on R 


_ 1 1l+2z 
tanh +2 = =] 
an. G ads Geena 


i (ea, (6.17) 


The inverse hyperbolic functions have first derivatives 


Dsinh™! ¢ = ——, Deosh"! 2 =, 
Dtanh-!z = i a 
—2x 


6.11 The Theorem of de l’H6pital 


This final section is entirely devoted to a single result, due to its relevance in com- 
puting the limits of indeterminate forms. Its proof can be found in Appendix A.4.2, 
p. 452. As always, c is one of 20, xo Bg « +00, =00. 


Theorem 6.41 Let f,g be maps defined on a neighbourhood of c, except 
possibly atc, and such that 


iene ey a ite ele) 8 


LC w—>C 


where L = 0,+00 or —oo. If f and g are differentiable around c, except 
possibly at c, with g' £0, and if 


f'(@) 


zs g/(x) 


exists (finite or not), then also 


im f(z) 
yu g(x) 


exists and equals the previous limit. 


' Some authors also like the symbol Arcsinh. 


6.11 The Theorem of de l’H6pital 201 


Under said hypotheses the results states that 


(6.20) 


Examples 6.42 
i) The limit 


im ———————_ 
z>0 sindxr 


gives rise to an indeterminate form of type o. Since numerator and denominator 
are differentiable functions, 


Qe? +202 A 


lim —————. = -. 
x>0 5cosda 5 
Therefore 
r e2 = e72e A 
im = — 
z>0 6©sindzr 5) 


ii) When the ratio f’(x)/g'(zx) is still an indeterminate form, supposing f and g 
are twice differentiable around c, except maybe at c, we can iterate the recipe of 
(6.20) by studying the limit of f’(x)/g"(a), and so on. 
Consider for instance the indeterminate form 0/0 
i 1+ 3a — ,/(1+ 22) 
im, —————_——.. 
r—0 esin & 
Differentiating numerator and denominator, we are lead to 
fF 3 —3V14 22 
im ———————__,, 
«0 sin x + £COSx 


still of the form 0/0. Thus we differentiate again 


epee: es 
a 22 
z— 0 2cosz — xsinz 2° 


Applying (6.20) twice allows to conclude 


ee ey es ae 
x—0 sin* x 2 

Remark 6.43 De l’Hopital’s Theorem is a sufficient condition only, for the exist- 

ence of (6.19). Otherwise said, it might happen that the limit of the derivatives’ 

difference quotient does not exist, whereas we have the limit of the functions’ dif- 

ference quotient. For example, set f(z) = 2+ sina and g(x) = 2x + cosa. While 

the ratio f’/g' does not admit limit as « — +00 (see Remark 4.19), the limit of 


f/g exists: 


e+sing i t+o(z) 1 


n-rtoo 2g + cose 2400 2a + ola) 2° 


202 6 Differential calculus 
6.11.1 Applications of de ’H6pital’s theorem 
We survey some situations where the result of de l’Hopital lends a helping hand. 


Fundamental limits 
By means of Theorem 6.41 we recover the important limits 


x 


lim — =-+0, lim |z|“e” = 0, Va éR, (6.21) 
£r++oo 7% ~——00 
l 
ima = = 3G, lim a°logr=0,  Va>0. (6.22) 
xL—>+oo go x—0+ 


These were presented in (5.6) in the equivalent formulation of the Landau symbols. 
Let us begin with the first of (6.21) when a = 1. From (6.20) 


x 
lim —= lm —=4+40o. 
Z++oo @£ xwt—+oo 1 


For any other a > 0, we have 


e® ice \ il _ e\* 
lim —= lm ie =— lim — = +00. 
t—++oo ro t—>+oo\a = a®* \y>t+oo y 
At last, for a < 0 the result is rather trivial because there is no indeterminacy. As 
for the second formula of (6.21) 


x|° ' x|\° ae 
lim |z|%e” = lim ee = lim id = lm —=0. 
@——0o0 r—>—oco ETF zt >—oo el yo>too ey 
Now to (6.22): 
1 
lo = ere 1 
l = lm 2__ = — lim —=0 
Z>+o00 eo t>+too aro—l Q t—>+00 © 
and 
1 
: . logz . = 
lim x2“ logxz = lim o* — lim —— = — 7 lim, z* =0 
a—0+ 20+ B-% = g-0+ (—a)x—9- Q x—+0+ 


Proof of Theorem 6.15 


We are now in a position to prove this earlier claim. 


Proof. By definition only, 


f'(xo) = tim f(z) oe F(#0) 


LLG i ZO 


but this is an indeterminate form, since 
im, (£06) ~ Fen) = gn (0) = 


hence de l’ Hopital implies 


fo) = Tia i) 


xL—->XO il 


6.12 Exercises 203 


Through examples we explain how de |’Hopital’s result detects the order of mag- 


Computing the order of magnitude of a map 
nitude of infinitesimal or infinite functions, and their principal parts. 
The function 


f(x) =e” —1—-sing 


is infinitesimal for x > 0. With infinitesimal test function y(x) = x we apply the 
theorem twice (supposing for a moment this is possible) 

. e*—1—sing e* — cosx 
lim = lim 

x0 i ies x0 


e” + sina 
= lim 
are! 


x20 a(a — 1)x2o-4 ; 
When a = 2 the right-most limit exists and is in fact 5. This fact alone justifies the 


use of de l’Hopital’s Theorem. Thus f(x) is infinitesimal of order 2 at the origin 
with respect to y(x) = 2; its principal part is p(x) = 32°. 
Next, consider 
{ (2) = lane, 


an infinite function for x + 57. Setting p(x) = 


=z» we have 
5) xv 
. tan x 
lim 


a 
— ee ee) 
+= lim sing lim 
re (= ) o4= 
ze 


ei 
I> > 


COS & 


While the first limit is 1, for the second we apply de l’H6pital’s Theorem 


The latter equals 1 when a = 1, so tan z is infinite of first order, for r + 5 , with 
iL 
respect to y(x) = 


= . The principal part is indeed y(z). 
77 x 


6.12 Exercises 


1. Discuss differentiability at the point x9 indicated: 
a) f(z) =2+|2£-1], 


tq =1 'b)| f(@) =sinlel, to =0 
= e-1/s? c £0 = 
[o)| F@) ‘° a; ) XO 


204 6 Differential calculus 


2. Say where the following maps are differentiable and find the derivatives: 


‘a)| f(@@) =e |x| b) f(x) =cos|a| 
c) jey= 4 if x > 0, fa] n= {7 "> if a: 


e*—a ifx <0 x—A ie <i 
3. Compute, where defined, the first derivative of: 
a) f(x) =3¢V14 2? b) f(x) = log|sinz| 
©) F(x) = cos (e** +) d) f(a) =— 


x log x 


4. On the given interval, find maximum and minimum of: 


a) f(x) =sinx+cosz, [0, 27] 


f(z) =2? -|n+1)-2, [-2,]] 


5. Write the equation of the tangent at xo to the graph of the following maps: 
f(w) =log(3e-2), my =2 db) f(@)= 5, 


xT 1 1 
joe", ro = 0 d) F(x) = sin —, ro = — 


Verify that f(x) = 5a + x? + 22° is invertible on R, f~' is differentiable on 
the same set, and compute (f~')/(0) and (f~+)'(8). 


Pq = 1 


Prove that f(x) = (4 — Le” + arctan(log x) + 2 is invertible on its domain 
and find the range. 


+1 
5 has no zeroes apart from %p = —1. 


Verify that f(x) = log(2+a)+2 


a+ 


Determine the number of zeroes and critical points of 


slogx—1 
2 


f(@) = 


x 
Discuss relative and absolute minima of the map 
: 1 
f(x) =2sinz + 5 cos 2x 


on |0, 27]. 


6.12 Exercises 205 


Find the largest interval containing to = 4 on which the function 


f(x) = log — 


log x 


has an inverse, which is also explicitly required. Calculate the derivative of the 
inverse at the origin. 


Verify that 


log(l+2) <a, Va >—-1. 


Sketch a graph for f(x) = 3x°—50a3+1352. Then find the largest and smallest 
possible numbers of real roots of f(x) +k, as k varies in the reals. 


Consider f(x) = x* — 2\/log@ and 
a) find its domain; 
b) discuss monotonicity; 
c) prove the point (e+ — 2, e) belongs to the graph of f~+, then compute the 
derivative of f~' at e+ — 2. 


Regarding 
xg2 —3 
ae 


a) find domain, limits at the domain’s boundary and possible asymptotes; 

b) study the intervals of monotonicity, the maximum and minimum points, 
specifying which are relative, which absolute; 

c) sketch a graph; 

d) define 


v] 


an ere fe 0; 


f(a — V3) roe 


Relying on the results found for f draw a picture of g, and study its 
continuity and differentiability at the origin. 


Given 


f(z) = v |x? — 4| — 2, 


a) find domain, limits at the domain’s boundary and asymptotes; 
b) determine the sign of f; 

c) study the intervals of monotonicity and list the extrema; 

d) detect the points of discontinuity and of non-differentiability; 
e) sketch the graph of f. 


Consider 
f(z) = Ver —1. 


206 6 Differential calculus 


a) What does f(x) do at the boundary of the domain? 
b) Where is f monotone, where not differentiable? 

c) Discuss convexity and find the inflection points. 

d) Sketch a graph. 


Let 


—j_—eclel 42 
f(z) =1-e cae 


be given. 

a) Find domain and asymptotes, if any; 

b) discuss differentiability and monotonic properties; 

c) determine maxima, minima, saying whether global or local; 
d) sketch the graph. 


Given 


f(x) = e*(x* — 8|a — 3] — 8), 


determine 

a) the monotonicity; 

b) the relative extrema and range im f; 

c) the points where f is not continuous, or not differentiable; 
d) a rough graph; 

e) whether there is a real a such that 


g(x) = f(a) — ala — 3| 


is of class C! over the whole real line. 


Given 
log |1 + a 
= es 

f(z) (1+<2)? 
find 
a) domain, behaviour at the boundary, asymptotes, 
b) monotonicity intervals, relative or absolute maxima and minima, 
c) convexity and inflection points, 
d) and sketch a graph. 


Let 
x log |z| 
io ———_ 
(*) 1 + log? |z| 

a) Prove f can be prolonged with continuity to R and discuss the differenti- 

ability of the prolongation g; 
b) determine the number of stationary points g has; 
c) draw a picture for g that takes monotonicity and asymptotes into account. 


6.12 Exercises 207 


Determine for 


|x| +3 

z—3 

a) domain, limits at the boundary, asymptotes; 

b) monotonicity, relative and absolute extremum points, inf f and sup f; 
c) differentiability; 

d) concavity and convexity; 

e) a graph that highlights the previous features. 


f(x) = arctan 


Consider the map 
f (x) = arcsin \/ 2e* — e2% 


and say 

a) what are the domain, the boundary limits, the asymptotes of f(x); 

b) at which points the function is differentiable; 

c) where f is monotone, where it reaches a maximum or a minimum; 

d) what the graph of f(x) looks like, using the information so far collected. 
e) Define a map f continuously prolonging f to the entire R. 


6.12.1 Solutions 


1. Differentiability: 


a) 
b) 


d) 


Not differentiable. 
The right and left limits of the difference quotient, for « — 0, are: 


sing —0 _ i sn sin(—a) — 0 _ 


im ———— ; —1. 
x>0t+ xc—O0 x07 xz—O0 


Consequently, the function is not differentiable at x9 = 0. 
For « # 0 the map is differentiable and 


2 2 
/ _ —1/a 
Moreover lim f(t) = lim f'(z) = 0, so f is continuous at zo = 0. By 
«w—> «wt 
Theorem 6.15, it is also differentiable at that point. 
Not differentiable. 


2. Differentiability: 


a) 


Because 
gje we>, 
w/—e ite 0, 


fe) = { 


208 6 Differential calculus 


f’ is certainly differentiable at 2 4 0 with 


ray ={ M2 ifa>0, 


oa ae ifa <0. 


The map is continuous on R (composites and products preserve continuity), 
hence in particular also at 2 = 0. Furthermore, Jim, ieQ= in f'(e) =, 
making f differentiable at « =0 , with f’(0) =0. 
b) Differentiable on R, f’(a#) = —sinz. 
2x ie 0, 
{ e*—1 ifx<0O. 
d) The map is clearly continuous for « 4 1; but also at x = 1, since 


c) Differentiable everywhere, f’(x) = 


im (a +e— 6) =f j—=—3 = lie (= 4). 


ait a—17 


The derivative is ; 
2er+1 ife>1, 


1 ife<1, 


ra) =4 


so f is differentiable at least on R \ {1}. Using Theorem 6.15 on the right- and 
left-hand derivatives independently, gives 


Li = tm Pooja. (= tae Pa) 
ait 
At the point x = 1, a corner, the function is not differentiable. 


3. Derivatives: 


5a? + 3 
a) f(t) = +28 b) f’(x) = cotan x 
2 2 l 1 
6) f(s) =—22e" cine” die (ny = ee 


4. Maxima and minima: 
Both functions are continuous so the existence of maxima and minima is guaran- 
teed by Weierstrass’s theorem. 


a) Maximum value V2 at the point 2 = 7; minimum —/2 at « = om. (The 
interval’s end-points are relative minimum and maximum points, not absolute.) 
b) One has 
2 . 
e+ae-1 ife<-l, 
re)={, | 
z—-a—-3 ifa>-l. 


The function coincides with the parabola y = (x + 4)? — 3 for x < —1. The 
latter has vertex in ( 5 


—4,-3) and is convex, so on the interval [—2,—1] of 
concern it decreases; its maximum is | at « = —2 and minimum —1 at x = —1. 


6.12 Exercises 209 


—3 
_B 


4 


Figure 6.16. Graph of f(x) = 2? — |x +1|—2 


For « > —1, we have the convex parabola y = (a— 4)?— 13 with vertex (4, — 43), 
3 


Thus on [—1,1], there is a minimum point x = 5 with image f(4) = — 
Besides, f(—1) = —1l and f(1) = —3, so the maximum —1 is reached at x = —1 


In conclusion, f has minimum —+2 (for z = 4) and maximum 1 (at x = —2); 
see Fig. 6.16. 
5. Tangent lines: 
a) Since 
f@)=— f2)=lg4, f=" 
~ Be — 2’ iia a 


the equation of the tangent is 


y = f(2) + f'(2)(a — 2) = log4 + “(2 =o). 


b) y=3 
c) As 
; 7 eVv2et1 _ 
BS ean ea f(0) = f'(0) =e, 


d) y=m(a—2). 


TT 


6. As sum of strictly increasing elementary functions on R, so is our function. 
Therefore invertibility follows. By continuity and because lim f(x) = +00, 
L—- =x 0o 


Corollary 4.30 implies im f = R. The function is differentiable on the real line, 
f'(z) =5+ 32? + 10x4 > 0 for all x € R; Theorem 6.9 tells that f~ is differenti- 
able on R. Eventually f(0) =0 and f(1) = 8, so 


210 6 Differential calculus 


1 1 
—~1ly\ -ly 
0 el d 8 — a é 
7. On the domain (0,+00) the map is strictly increasing (as sum of strictly in- 
creasing maps), hence invertible. Monotonicity follows also from the positivity of 


IL 


‘(z) = On? — Qn + 1)e™ + ————__-. 
F(x) =( ) a(1+ log? x) 


In addition, f is continuous, so Corollary 4.30 ensures that the range is an interval 
bounded by inf f and sup f: 


20+ L—> +00 
Therefore im f = (1 — §, +00). 


8. The map is defined only for « > —2, and continuous, strictly increasing on the 
whole domain as 
1 2 


i aT a Va >—-2. 
Therefore f(x) < f(1) =0 for x <1 and f(x) > f(1)=0forz >1. 


9. The domain is x > 0. The zeroes solve 
: 1 
slogx—1=0 i.e. logx=-. 
x 
If we set h(x) = log x and g(x) = 4, then 
AO) =0<1=9(1) and h(e) =1> = g(e); 


Corollary 4.27 says there is an zp € (1,e) such that h(xo) = g(xo). Such a point 
has to be unique because h is strictly increasing and g strictly decreasing. Thus f 
has only one vanishing point, confined inside (1, e). 

For the critical points, we compute the first derivative: 


fey — Cee = eeleeed) _ Stet 
_ : _ | 


x ge 
The zeroes of f’ are then the roots of 
2 
e+2-—cxlogz=0 i.e. log x2 = 5 
i 
Let g(x) = +£ = 1+ 2, whence 
2 2 


hle) =1<1+ 7 =9(e) and h(e’) =2>1+ 5 =H’); 


6.12 Exercises 211 


again, Corollary 4.27 indicates a unique Zo € (e,e”) with h(Zo) = g(Zo) (unique- 
ness follows from the monotonicity of h and g). In conclusion, f has precisely one 
critical point, lying in (e, e”). 


10. In virtue of the duplication formulas (2.13), 
f'(x) = 2cosx — sin 2x = 2cosz(1 — sin 2). 


Thus f(z) = 0 when x = § and x = 3n, f’(x) > 0 for0 <a < $ or 3a <a < Qn. 


This says « = 5 is an absolute maximum point, where f(5) = 3. while x = 27 


gives an absolute minimum f (37) = —3. Additionally, f(0) = f(2m) = 4 so the 
boundary of [0, 27] are extrema: more precisely, x = 0 is a minimum point and 


x = 27 a maximum point. 


11. Since f is defined on x > O with x ¥ 1, the maximal interval containing 
Lo = $ where f is invertible must be a subset of (0,1). On the latter, we study the 
monotonicity, or equivalently the invertibility, of f which is, remember, continuous 
everywhere on the domain. Since 


1 1 log? 1 
f(2)=—+ ee 


«  glog?x x log? x 


it is immediate to see f’(x) > 0 for any x € (0,1), meaning f is strictly increasing 
on (0,1). Therefore the largest interval of invertibility is indeed (0, 1). 
To write the inverse explicitly, put t = log x so that 


1 + /y? +4 
y=t-5, O-ty-1=0, ;-¥evy tr 


and changing variable back to 2, 


ytVy2+4 


= y-Vy? +4 
c=f-W=s- 7; 
or, in the more customary notation, 
ze 
y=f@)=e 7. 
Eventually f~1(0) =e~+, so 
1 il 


212 6 Differential calculus 


12. The function f(x) = log(1 +) — @ is defined on x > —1, and 


li — li = lim (- =—0o. 
jim f(@)=-co, tim, f(x) = lim (— # + o(2)) = —o0 
As i 

/ — a 

A cee a. 


¢ = 0 is critical, plus f(z) > 0 on & < 0 and f'(x) < 0 for ¢ > 0. Thus 
f increases on (—1,0] and decreases on [0,+00); x = 0 is the point where the 
absolute maximum f(0) = 0 is reached. In conclusion f(x) < f(0) = 0, for all 
a 


13. One checks f is odd, plus 


f'(a) = 15e* — 1502? + 135 = 15(2* — 1027 + 9) 
= 15(x? — 1)(2? — 9) = 15(a + 1)(@ — 1)(a + 3)(a — 3). 


The sign of f’ is summarised in the diagram: 


What this tells is that f is increasing on (—oo, —3], [—1,1] and [3,+00), while 
decreasing on [—3, —1] and [1,3]. The points x = —1, x = 3 are relative minima, 
x =1 and x = —3 relative maxima: 


Figure 6.17. The function f(x) = 32° — 502° + 1352 


6.12 Exercises 213 
f(1) = —f(-1) = 88 and f (3) = —f(-3) = —216. 


Besides, 
jim f(z) =—-o, lim f(x) = +00. 


w—++00 
The graph of f is in Fig. 6.17. 


The second problem posed is equivalent to studying the number of solutions of 
f(x) = —k as k varies: this is the number of intersections between the graph of f 
and the line y = —k. Indeed, 


ifk > 216 ork < —216 one solution 


if k = +216 two solutions 


if k € (—216, —88) U (88, 216) _ three solutions 


if k = +88 four solutions 


if k € (—88, 88) five solutions. 


This gives the maximum (5) and minimum (1) number of roots of the polynomial 
3x° — 50x? + 135¢ + k. 

14. Study of the function f(x) = «+ — 2\/log a: 

a) Necessarily x > 0 and logx > 0, ie., x > 1, so dom f = [1, +00). 

b) From 

Ax*/log x — 1 


/ _ 
Lae xv log x 
we have 
1 
fost = Ax*,/loge=1 <> oie) =lex= 1628 = g2(2). 
x 


On x > 1 there is an intersection x9 between the graphs of gi, go (Fig. 6.18). 
Hence f’(x) > 0 for x > 20, f is decreasing on [1, xo], increasing on [x%9, +00). 


g2(x) / gi(x) 


Zo 


1 


Figure 6.18. Graphs of gi(x) = logz and go(x) = 628 


214 


1D. 


a) 


c) 


6 Differential calculus 


This makes xp a minimum point, and monotonicity gives f invertible on [1, xo] 
and [xo, +00). In addition, gi(1) = log1 = 0 < 4 = go(1) and gi (2) = log 2 > 
siz = 92(2), which implies 1 < ro < 2. 
As f(e) = e* — 2, the point (e+ — 2, e) belongs to the graph of f~! and 

1 e 


(fF y'(e oe) ergy ee 


Study of f(x) = =: 


The domain is determined by x? —3 > 0 together with « 4 —1, hence dom f = 
(—00, —V3] U [\/3, +00). At the boundary points: 


i es jt, /1-3 Isl _ 4, 
ere . mee bel + +) goto rg) 
la. F(a) = lim. fie) =0, 


so y = 1 is the horizontal right asymptote, y = —1 the horizontal left asymp- 
tote. 


The derivative 
r+3 


vanishes at x = —3 and is positive for x € (-3,-V3) U (V3, +oo). Thus f 
is increasing on [—3, —V3] and [V3, +00), decreasing on (—oo, —3]; = —3 


is an absolute minimum with f(—3) = 6 < —1. Furthermore, the points 


f(a) = 


« = +3 are extrema too, namely « = —V/3 is a relative maximum, x = V3 
a relative minimum: f (+v3) = 0. 
Fig. 6.19 (left) shows the graph of f. 


Figure 6.19. Graphs of f (left) and g (right) of Exercise 15 


d) 


16. 


6.12 Exercises 215 


Right-translating the negative branch of f by V3 gives g(a) for x < 0, whereas 
shifting to the left the branch on x > 0 gives the positive part of g. The final 
result is shown in Fig.6.19 (right). 

The map g is continuous on R, in particular 


lim, q(¢)= lim f'(@)= lim f'(#)=+c 


g is not differentiable at x = 0. 
Study of f(x) = /|xz? — 4| — a: 
The domain is R and 


ie AS en, ye lim f(x) =+ 
ee eee ge 
Thus y = 0 is a horizontal right asymptote. Let us search for oblique asymp- 
totic directions. As 


d 
fe (-yi-3-1)--2 
Z+>—-co 2 r—>—0o x 


—4~— x? 
(Va? -4+2) = lim Se 


a—+—oo /e? —4— 4 


lim (f(z) + 2x) = lim 


«rL—-— Co w—>>—Co 
the line y = —2z is an oblique left asymptote. 
It suffices to solve ,/|x? — 4|—a > 0. First, ,/|x? — 4| > x for any x < 0. When 
x > 0, we distinguish two cases: x? — 4 < 0 (so 0 < # < 2) and 27-4 > 0 (ie., 
o> 2). 
On 0 < x < 2, squaring gives 


4-27? > x? = cHV6 = O0<ar< v2. 


For x > 2, squaring implies x? — 4 > x”, which holds nowhere. The function 
then vanishes only at « = V2, is positive on x < 2 and strictly negative for 


z> V/2. 


Since 
K(x) 4—272-a27 if -2<2<2, 
t)= 
v2—-4—a ifa<-2,27>2, 
we have 
—2x 
——. - 1 if-2<27<2, 
/ — 4 — x? 
f(e)=¢ V4- 


-1 ifg<-2,2>2. 


216 


e) 


6 Differential calculus 


Figure 6.20. The function f(x) = ,/|x? — 4| —a 


When —2 <2 < 2, f(x) >O0ifa+V4—<22 <0, that is V4 —2? < —x. The 
inequality does not hold for x > 0; on —2 < x < 0 we square, so that 


4-2? <x? es 2? —2>0 = =) ge < aD 


Hence f’(x) = 0 for x = —V2, f’(x) > 0 for -2 < x < —V2 and f'(x) < 0 
when —/2 <x <2. 

If x <—2ora > 2, f(z) > Oife—Va2?-—4>0,ie., Vx? —4 < a. The latter 
is never true for x < —2; for x > 2, x? > x? — 4 is a always true. Therefore 
f'(z) > 0 pera >2e f'(x) <0 perx< —2. 

Summary: f decreases on (—oo, —2] and [— V2, 2], increases on [—2, —V2] and 
[2, +00). The points « = +2 are relative minima, x = —\/2 a relative max- 
imum. The corresponding values are f(—2) = 2, f(2) = —2, f(—V2) = 2v2, 
so © = 2 is actually a global minimum. 


As composite of continuous elementary maps, f is continuous on its domain. 
To study differentiability it is enough to examine f’ for 7 > +2. Because 


lim, 7'(2) = 06, 
w—>+2 


at x = +2 there is no differentiability. 
The graph is shown in Fig. 6.20. 


17. Study of f(x) = Ve2" — 1: 


a) 


b) 


The function is defined everywhere 


lim f(x) = +00 and lim f(x) =-1. 


xL—+00 L—->—oCoO 


The first derivative 


d) 


18. 


a) 


b) 


6.12 Exercises 217 


Figure 6.21. The map f(x) = We?” —1 


is positive for « € R \ {0}, and f is not differentiable at x = 0, for lim, iQ= 
tea 

+oo. Therefore f increases everywhere on R. 

The second derivative (for x # 0) 


4 et —3 
" — 2,22 
fiz) = 9° (ee — WE 


vanishes at © = + log 3; it is positive when x € (—oo, 0) U (3 log 3, +00). This 
makes « = $log3 an ascending inflection, plus f convex on (—oo, 0] and 
[3 log 3, +00), concave on [0, 4 log 3]. Suitably extending the definition, the 
point z = 0 may be acknowledged as an inflection (with vertical tangent). 


See Fig. 6.21. 


Study of f(x) =1—e7!#| + = 


Clearly dom f = R. As 


lim e7!*|=0, 
«wL—- roo 


we immediately obtain 


im 7 (e) =e 
L—-CO 


1 ell 1 1 
lim PAP) = lim (=-* to)=5 
z—>+tco r—>roo \ LX x e e 


lim (F(a) a =) = lim (l—e ll) =1, 


wL—>XOCO L— roo 


This makes y = ty, + 1 a complete oblique asymptote. 


The map is continuous on R, and certainly differentiable for x 4 0. As 


218 6 Differential calculus 


Figure 6.22. Graph of f(x) =1—e7!*! + = 


on ee 
ee +- ifa>0, 
e 


it follows 


preventing differentiability at x = 0. 
Moreover, for z > 0 we have f’(x) > 0. On x < 0, f’(x) > 0 if e” < 3, 
i.e., © < —1. The map is increasing on (—oo, —1] and [0,+00), decreasing on 
[—1, 0]. 

c) The previous considerations imply x = —1 is a local maximum with f(—1) = 
1- 2, x =0 a local minimum where f(0) = 0. 


d) See Fig. 6.22. 
19. Study of f(x) = e*(x? — 8|x — 3| — 8): 
a) The domain covers R. Since 
e"(x* + 82 —32) ifa <3, 
i) ee 
e"(2* —84+16) ifa>3, 


we have P 
e"(2*° +10%—24) ifa <3, 
ro-| 


e” (x* — 6a + 8) li o> S. 


6.12 Exercises 219 


On « < 3: f’(x) = 0 if 27+ 102 —24 = 0, so x = —12 or x = 2, while f’(x) >0 
if x € (—oo, -12) U (2,3). On x > 3: f'(x) = 0 if 227 -62+8=0, ie, r=4 
(x = 2 is a root, but lying outside the interval x > 3 we are considering), while 
f'(x) > Oif x € (4, +00). 
Therefore f is increasing on the intervals (—oo, —12], [2,3] and [4,+00), de- 
creasing on [—12, 2] and [8, 4]. 

b) From part a) we know x = —12 and x = 3 are relative maxima, x = 2andz =4 
relative minima: f(—12) = 16e~', f(2) = —12e?, f(3) =e? and f(4) =0. For 
the range, let us determine 


lim je) = jim e*(x* + 8a — 32) =0, 


L—— Co 
: _ ; E(pd = 
im f(@) = ime (2* — 82 + 16) = +00. 


Continuity implies 
im f = [min f(z), sup f(x)) = [f(2), +oo) = [-12e?, too). 


c) No discontinuities are present, for the map is the composite of continuous 
functions. As for the differentiability, the only unclear point is x = 3. But 


lim f’(x) = lim e*(x? +102 — 24) = 15e?, 


t37 wr>37 
lim f'(z) = lim e*(x2* — 62 +8) =—e’, 
a a 


sof is not differentiable at x = 3. 


d) See Fig. 6.23; a neighbourhood of « = —12 is magnified. 


—12e7/ - - 


=Je ig 


Figure 6.23. Graph of f(x) = e" (x? — 8|x — 3| — 8) 


220 


e) 


20. 


iv) 
a 


d) 


6 Differential calculus 


The function g is continuous on the real axis and 


(a) e*(27 +102 —24)+a0 ifa <3, 
Gg \t) = 
e* (x — 64 +8)—a Ts a 


In order for g to be differentiable at x = 3, we must have 


lim g'(z) =15e? +a = lim g/(z) =—-e® — a; 
r—3 233+ 
the value a = —8e? makes g of class C! on the whole real line. 
Study of f(x) = we: 
dom f = R \ {-1}. By (5.6) c) 
line, pcan) = 
Lr CO 
while se 
fom F(a) _ ot a 
From this, x = —1 is a vertical asymptote, and y = 0 is a complete oblique 
asymptote. 
The derivative 
1—2log|x+1| 
i 
pa) (x +1) 


tells that f(a) is differentiable on the domain; f’(a) = 0 if |a +1] = \/e, hence 
for s = —l+,/e; f’(x) > Oif a € (—co, —/e—1)U(-1, //e—1). All this says f 
increases on (—oo, —\/e—1] and (—1, —-1+/e], decreases on [—,\/e—1, —1) and 
[-1+./e, +00), has (absolute) maxima at « = —1+,/e, with f(-1+/e) = 3 


De° 


From 
—5+ 6log|x+1| 
" _ 
Pe (x +1)4 


the second derivative is defined at each point of dom f, and vanishes at |z+1| = 
e°/6 so a = —1+e°/®. Since f’(x) > 0 on x € (—00, —1—e*/6)U(e5/®—1, +00), 
f is convex on (—oo, —1 — e®/§] and [e®/® — 1, +00), while is concave on [—1— 
e°/®, 1) and (—1, e°/® — 1]. The points = —1 + e°/ are inflections. 

See Fig. 6.24. 


21. Study of f(x) = vlog |x|. 


a) 


1+log? |x| * 
The domain is clear: dom f = R\ {0}. Since lim f(x) =0 (a ‘wins’ against the 
xr 


logarithm) we can extend f to R with continuity, by defining g(0) = 0. The 
function is odd, so we shall restrict the study to x > 0. 


6.12 Exercises 221 


Figure 6.24. Graph of f(x) = “8/42! 


(+a)? 


As far as the differentiability is concerned, when x > 0 


_ log? x — log? x + log x + 1 


/ 
a 
He) (1 + log? x)? 
with t = logz, the limit reads 


a _ B-?+t4+1 ao 
a 2 Gee ee 

Therefore the map g, prolongation of f, is not only continuous but also differ- 
entiable, due to Theorem 6.15, on the entire R. In particular g’(0) = 0. 

Part a) is also telling that x = 0 is stationary for g. To find other critical 
points, we look at the zeroes of the map h(t) = t? —t?+¢+1, where t = log x 
(a > 0). Since 


lim h(t) = —oo, lim A(t) = +00, 
too tod 
h(0) = 1, h'(t) = 3? -2t+1>0, VteER, 


h is always increasing and has one negative zero tg. Its graph is represented in 
Fig. 6.25 (left). 
As to = logxo < 0, 0 < xp = ce” < 1. But the function is odd, so g has two 
more stationary points, x9 and —Zo respectively. 
By the previous part g/(x) > 0 on (a,+00) and g’(x) < 0 on (0, 2). To 
summarise then, g (odd) is increasing on (—oo, —2] and [29, +00), decreasing 
on [—2o, Xo]. Because 

lim g(x) = +00 


xL—+00 
and 
lim AS) lim ae im ——~ =), 
t>+oo0 2£ xz+too J + log xv t3+oo 1+ t? 


222 


6 Differential calculus 


Figure 6.25. The functions h (left) and g (right) of Exercise 21 


there are no asymptotes. 
For the graph see Fig. 6.25 (right). 


22. Study of f(x) = arctan 243; 


a) 


x—3 
dom f = R \ {3}. The function is more explicitly given by 


3 
arctan = arctan(—1) = -7 ie = 0, 
x — 
f(x) = 
arctan if x > 0, 
x —= 
whence 
: ¢ as as ; 7 
eee es es 
: 6 T 
lim f(x) = arctan — = arctan(—oo) = —-, 
r37 O- 2 
lim f(a#) = arcta e arctan(+00) is 
im = ar n— = ar o)= —-. 
r33t : Ot 2 
Then the straight lines y = —4 , y = 4 are horizontal asymptotes (left and 
right respectively). 
The map 
0 ife <0, 


is negative on x > 0, « £3, so f is strictly decreasing on [0,3) and (3,+00), 
but only non-increasing on (—oo, 3). The reader should take care that f is 
not strictly decreasing on the whole [0,3) U (3,+00) (recall the remarks of 
p. 197). The interval (—oo, 0) consists of points of relative non-strict maxima 
and minima, for f(«) = —4, whereas x = 0 is a relative maximum. 


6.12 Exercises 223 


ls 


Figure 6.26. The function f(x) = arctan Iz <2 
i —. 
Eventually, inf f(z) = —4, sup f(z) = 5 (the map admits no maximum, nor 


minimum). 
c) Our map is certainly differentiable on R \ {0,3}. At x = 3, f is not defined; at 
x =0, f is continuous but 


i / ‘ ' ; 3 ik 
pe a ees 
showing that differentiability does not extend beyond R \ {0,3}. 
d) Computing 
0 ifa<0O, 
{a= 6x 
(2? +9)? 
reveals that f”(2) > 0 for « > 0 with x 4 3, so f is convex on [0,3) and 
(3, +00). 
e) See Fig. 6.26. 


23. Study of f(x) = arcsin V 2e* — e?*: 

a) We have to impose 2e” —e? > Q and —1 < V2e% — e2 < 1 for the domain; the 
first constraint is equivalent to 2—e” > 0, hence x < log 2. Having assumed that 
square roots are always positive, the second inequality reduces to 2e*—e?" < 1. 
With y = e”, we can write y? — 2y +1 = (y—1)? > 0, which is always true. 
Thus dom f = (—o, log 2]. Moreover, 

lim f(z)=0, — f(log2) =0, 


xL—— Co 


ite> 0, wed, 


and y = 0 is a horizontal left asymptote. 


224 6 Differential calculus 


b) From 
e*(1 — e” e”(1 — e* 

f(z) — ( ) 5 = ( ) 5 
et (2 — e*)(1 — 2e” + e2”) e* (2 — e7)(1 — e”) 
ee if0< a2 < log2, 

e* (2 — e”) 
—— ieee 0, 
e (2 — e”) 
we see that 
: ! a : ! a : ! = 
oe F@j=—oo, tim fi(z)=—1, dim fi(x) = 1. 


In this way f is not differentiable at x = log 2, where the tangent is vertical, 
and at the corner point x = 0. 


c) The sign of f’ is positive for x < 0 and negative for 0 < x < log 2, meaning that 
xz = 0 is a global maximum point, f(0) = 5, while at x = log 2 the absolute 
minimum f(log2) = 0 is reached; f is monotone on (—oo, 0] (increasing) and 
[(0, log 2] (decreasing). 

d) See Fig. 6.27. 


e) A possible choice to extend f with continuity is 
ie < log? 


f(x) = 
@) 0 if x > log2. 


NTE 


log 2 


Figure 6.27. The map f(x) = arcsin /2e* — e2% 


7 


Taylor expansions and applications 


The Taylor expansion of a function around a real point xo is the representation of 
the map as sum of a polynomial of a certain degree and an infinitesimal function of 
order bigger than the degree. It provides an extremely effective tool both from the 
qualitative and the quantitative point of view. In a small enough neighbourhood of 
xo one can approximate the function, however complicated, using the polynomial; 
the qualitative features of the latter are immediate, and polynomials are easy to 
handle. The expansions of the main elementary functions can be aptly combined 
to produce more involved expressions, in a way not dissimilar to the algebra of 
polynomials. 


7.1 Taylor formulas 


We wish to tackle the problem of approximating a function f, around a given point 
xo € R, by polynomials of increasingly higher degree. 

We begin by assuming f be continuous at x9. Introducing the constant poly- 
nomial (degree zero) 


DT Tole) = 7 (ea) Vc ER, 


formula (5.4) prompts us to write 
f(a) =T fo2(¢) +00), 2 zo. (7.1) 


Put in different terms, we may approximate f around zo using a zero-degree- 
polynomial, in such a way that the difference f(x) — T fo,2,(a) (called error of 
approximation, or remainder), is infinitesimal at xo (Fig. 7.1). The above relation 
is the first instance of a Taylor formula. 


Suppose now f is not only continuous but also differentiable at xo: then the 
first formula of the finite increment (6.11) holds. By defining the polynomial in x 
of degree one 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_7, 
© Springer International Publishing Switzerland 2015 


226 7 Taylor expansions and applications 


A y = f(x) 


f(@o) +---- y = T fo(«) 


Xo 


Figure 7.1. Local approximation of f by the polynomial T' fo = T'fo,«, 


T Hiuale) =f wa) +f Go) (2 — xo), 
whose graph is the tangent line to f at xo (Fig. 7.2), relation (6.11) reads 


fie) =Thfi«(@) + 0@— 20), 2 zo. (7.2) 


This is another Taylor formula: it says that a differentiable map at xo can be 
locally approximated by a linear function, with an error of approximation that 
not only tends to 0 as x > 2, but is infinitesimal of order bigger than one. 

In case f is differentiable in a neighbourhood of xo, except perhaps at xo, the 
second formula of the finite increment (6.13) is available: putting 71 = xo, 72 = 2x 
we write the latter as 


f(x) = T fo,eo(2) + f'(&)(w — 20), (7.3) 


y = f(z) 


y=T fila) 


fli). |bocacseneseqeeaqes 


Zo 


Figure 7.2. Local approximation of f by the polynomial Tf; = T’f1,2, 


7.1 Taylor formulas 227 


where Z denotes a suitable point between xp and x. Compare this with (7.1): now 
we have a more accurate expression for the remainder. This allows to appraise 
numerically the accuracy of the approximation, once the increment x — x9 and an 
estimate of f’ around 2 are known. Formula (7.3) is of Taylor type as well, and 
the remainder is called Lagrange’s remainder. In (7.1), (7.2) we call it Peano’s 
remainder, instead. 


Now that we have approximated f with polynomials of degrees 0 or 1, as 
% —> Zo, and made errors o(1) = o((%—0)°) or o(a— 29) respectively, the natural 
question is whether it is possible to approximate the function by a quadratic 
polynomial, with an error o((x _ x)") as © — Xq. Equivalently, we seek for a real 
number a such that 


f(x) = f (vo) + f’(xo)(a — xo) + a(a — 20)? +0((@—29)*), wa. (7.4) 


This means 


tim £(@) = Feo) = F'@o)(# — #0) — a(@ — #0)? 
LZzg Gs _ xo)? 


By de l’Hopital’s Theorem, such limit holds if 


f'(a) — f'(@o) — 2a(a— #0) _ 


li — 
4-F0 2(x — Xo) - 
L.€., 
/ _ fl 
um (1L@=Le0) 4) 
rr \ 2 L— XO 
or F ; ; 
1 om f(a) = f(x) _ 
2&2 L— Xo 


We conclude that (7.4) is valid when the right-hand-side limit exists and is finite: in 
other words, when f is twice differentiable at x9. If so, the coefficient a is s f"'(xo). 
In this way we have obtained the Taylor formula (with Peano’s remainder) 


Tp G2 IE oe Gah o( (x — xo)*), LX, (7.5) 


where i 
T fosao() = f(a0) + f'(00)(« ~ 20) + 5 F"(0)(a ~ 20)? 
is the Taylor polynomial of f at xo with degree 2 (Fig. 7.3). 


The recipe just described can be iterated, and leads to polynomial approxim- 
ations of increasing order. The final result is the content of the next theorem. 


228 7 Taylor expansions and applications 


Figure 7.3. Local approximation of f by T fe = T f2,2¢ 


Theorem 7.1 (Taylor formula with Peano’s remainder) Let n > 0 and 
f ben times differentiable at x9. Then the Taylor formula holds 


f(z) =T fnxo(@) + o((t@ —20)"), 2-20, (7.6) 


where 


n 


TFazol®) = >. yf (eo)(@ - 20)! 
—o en) 


= f(xo) + f’(xo)(@ — 20) +... + =F) (ao) — 120)”. 


The term T fn.x)(x) is the Taylor polynomial of f at xo of order (or degree) 
n, while o((x — ao)”) as in (7.6) is Peano’s remainder of order n. The rep- 
resentation of f given by (7.6) is called Taylor expansion of f at xo of order n, 
with remainder in Peano’s form. 


Under stronger hypotheses on f we may furnish a preciser formula for the 
remainder, thus extending (7.3). 


Theorem 7.2 (Taylor formula with Lagrange’s remainder) Let n > 0 
and f differentiable n times at xo, with continuous nth derivative, be given; 
suppose f is differentiable n+1 times around xo, except possibly at x9. Then 
the Taylor formula 


f(z) = LOG (a) 


holds, for a suitable between xo and x. 


7.2 Expanding the elementary functions 229 


This remainder is said Lagrange’s remainder of order n, and (7.8) is the Taylor 
expansion of f at xo of order n with Lagrange’s remainder. 

Theorems 7.1 and 7.2 are proven in Appendix A.4.4, p. 456. 

An additional form of the remainder of order n in a Taylor formula, called 
integral remainder, will be provided in Theorem 9.44. 


A Taylor expansion centred at the origin (x9 = 0) is sometimes called Mac- 
laurin expansion. A useful relation to simplify the computation of a Maclaurin 
expansion goes as follows. 


Property 7.3 The Maclaurin polynomial of an even (respectively, odd) map 


involves only even (odd) powers of the independent variable. 


Proof. If f is even and n times differentiable around the origin, the claim follows 
from (7.7) with x = 0, provided we show all derivatives of odd order 
vanish at the origin. 

Recalling Property 6.12, f even implies f’ odd, f” even, f’” odd et cetera. 
In general, even-order derivatives f‘?*) are even functions, whereas f(?*+)) 
are odd. But an odd map g must necessarily vanish at the origin (if defined 
there), because x = 0 in g(—a) = —g(x) gives g(0) = —g(0), whence 
gt = 0. 

The argument is the same for f odd. 


7.2 Expanding the elementary functions 


The general results permit to expand simple elementary functions. Other functions 
will be discussed in Sect. 7.3. 


The exponential function 
Let f(x) =e”. Since all derivatives are identical with e”, we have f‘*)(0) = 1 for 
any k > 0. Maclaurin’s expansion of order n with Peano’s remainder is 


for a certain £ between 0 and xz. | (7.10) 


Maclaurin’s polynomials for e” of order n = 1,2,3,4 are shown in Fig. 7.4. 


230 7 Taylor expansions and applications 


A f 

T fa 
T fs 
T fo 

T fo Thi 

T fa 1 
f > 
0 
T fs Thi 


Figure 7.4. Local approximation of f(x) =e” by T fn = Tfn,o for n = 1,2,3,4 


Remark 7.4 Set x = 1 in the previous formula: 


1 
e=) 5+ (con0< 2% <1). 


oe 
a= S- a (741) 


because 1 < e® < e < 3 moreover, the following is an estimate of the error: 


—— <e-e, < ——. 
(n+ 1)! "“ (n+)! 
In contrast to the sequence {an = (1+ 1)"} used to define the constant e, the 
sequence {e,,} converges at the rate of a factorial, hence very rapidly (compare 
Tables 7.1 and 3.1). Formula (7.11) gives therefore an excellent numerical approx- 
imation of the number e. a 


The expansion of f(x) = e” at a generic x9 follows from the fact that f") (a9) = e”° 


7.2 Expanding the elementary functions 231 


3 


en 


j=) 


1.0000000000000 
2.0000000000000 
2.5000000000000 
2.6666666666667 
2.7083333333333 
2.7166666666667 
2.7180555555556 
2.7182539682540 
2.7182787698413 
2.7182815255732 
2.7182818011464 


COCONDOKBWN FE 


HH 
j=) 


Table 7.1. Values of the sequence {e,,} of (7.11) 


The logarithm 
The derivatives of the function f(x) = log are 


fi(e)=—sa%, f"(a)=(-la?, f(a) =(-1)(-2)2*, 


and in general, 


f(x) = (-1)* Tk - Dla. 


Thus for k > 1, 


Let us change the independent variable x — 1 > x, to obtain the Maclaurin ex- 
pansion of order n of log(1 4+ x) 


(7.13) 


The Maclaurin polynomials of order n = 1, 2,3, 4 for y = log(1+ 2) are represented 
in Fig. 7.5. 


232 7 Taylor expansions and applications 


A Ifa T fi 


T fa 
Figure 7.5. Local approximation of f(x) = log(1+ 2) by Tfn = Tfn,o for n = 1,2,3,4 


The trigonometric functions 

The function f(x) = sin is odd, so by Property 7.3 its Maclaurin expansion con- 
tains just odd powers of z. We have f’(x) = cosa, f’”(«) = — cos and in general 
fktD (2) = (-1)* cosa, whence f(?*+)(0) = (—1)*. Maclaurin’s expansion up 
to order n = 2m + 2 reads 


(7.14) 


The typical structure of the expansion of an odd map should be noticed. Mac- 
laurin’s polynomial T’f2m+2,0 of even order 2m + 2 coincides with the polynomial 
T fom+i,o of odd degree 2m +1, for f?™*?)(0) = 0. Stopping at order 2m +1 
would have rendered 


m 
‘ g2ktl 


sing = 2-9) Q@k+1)! + aa) 


to which (7.14) is preferable, because it contains more information on the re- 
mainder’s behaviour when x —> 0. Figure 7.6 represents the Maclaurin polynomials 
of degree 2m +1, 0 < m <6, of the sine. 

As far as the even map f(x) = cos is concerned, only even exponents appear. 
From f’(x) = —cosa, f(x) = cosx and f?*)(x) = (—-1)* cosa, it follows 
f(?)(0) = (—1)*, so Maclaurin’s expansion of order n = 2m + 1 is 


7.2 Expanding the elementary functions 233 


fz T fo 


T fo T fz 


Figure 7.6. Local approximation of f(x) = sinx by polynomials T fom+41 = T fom+i,o 
withO<m<6 


(7.15) 


The considerations made about the sine apply also here. Maclaurin’s polynomials 
of order 2m (1 < m < 6) for y = cosa can be seen in Fig. 7.7. 


T fs T fs 
A 


T fe T fe 


Figure 7.7. Local approximation of f(x) = cosxz by T fom = T fam,o when 1 <m< 6 


234 7 Taylor expansions and applications 


Power functions 
Consider the family of maps f(x) = (1+ 2)° for arbitrary a € R. We have 


f(z) =a(l+a)e" 
f'@)=alo=—DAta™ 
f'" (a) = aa - 1)(a-2)(14+2)°° 


From the general relation f(x) = a(a —1)...(a—k+1)(1+2)°~* we get 
f(0) — a(a-1)---(a-k+1) 
kl k! 


At this point it becomes convenient to extend the notion of binomial coefficient 
(1.10), and allow a to be any real number by putting, in analogy to (1.11), 


fork >1. 


a(a—1)---(a—k+1) 


fork >1. (7.16) 


(7.17) 


a=-l 
(3) _ Ie) = (3) = SS ee 
()) = PS ay 


Choosing a = 5 gives 
(3) 24-9 2 (3) - 24 Oa -9 _ 5 
3 | 


and the expansion of f(x) = 1+ arrested to the third order is 


7.2 Expanding the elementary functions 235 


—l 0 


Figure 7.8. Local approximation of f(x) = /1+ <2 by Tfn = Tfn,o for n = 1,2,3,4 


The polynomials of order n = 1,2,3,4 are shown in Fig. 7.8. 


For conveniency, the following table collects the expansions with Peano’s re- 
mainder obtained so far. A more comprehensive list is found on p. 476. 


Tye ike 
ese = 1 = 


2 zh 


=—1 
(+0) =1+a0e+ 29,2 
1 2 npn n 
To be —...+(-1)"2” + o(2”) 


1 1 1 
AA Ce) 


236 7 Taylor expansions and applications 


7.3 Operations on Taylor expansions 


Consider the situation where a map f has a complicated analytic expression, that 
involves several elementary functions; it might not be that simple to find its Taylor 
expansion using the definition, because computing derivatives at a point up to a 
certain order n is no straighforward task. But with the expansions of the element- 
ary functions at our avail, a more convenient strategy may be to start from these 
and combine them suitably to arrive at f. The techniques are explained in this 
section. 
This approach is indeed justified by the following result. 


Proposition 7.5 Let f : (a,b) > R be n times differentiable at xo € (a,b). 
If there exists a polynomial P,,, of degree < n, such that 


P,,(x) + 0((@ — ao)”) for x > Zo, (7.19) 


then P,, is the Taylor polynomial T, = T fn,2, of order n for the map f at xo. 


Proof. Formula (7.19) is equivalent to 
P,(x) = f(x) + v(2), with v(x) = o((x —20)") for x > 20. 
On the other hand, Taylor’s formula for f at xo reads 
Tr(a) = f(x) + v(x), with (x) = o((x—20)"). 
Therefore 
Pa) — Ta() = ye) —Y(a) = 0((e@—20)"). (7.20) 


But the difference P,,(x) — T;,(x) is a polynomial of degree lesser or equal 
than n, hence it may be written as 


Pi2) =T.(2) = ‘y Cr(a@ — zo)". 
k=0 


The claim is that all coefficients c;, vanish. Suppose, by contradiction, there 
are some non-zero cz, and let m be the smallest index between 0 and n 
such that Cm #0. Then 


S10) 


7.3 Operations on Taylor expansions 237 


by factoring out (« — 29)”. Taking the limit for « —> 29 and recalling 
(7.20), we obtain 
= oa, 


in contrast with the assumption. 


The proposition guarantees that however we arrive at an expression like (7.19) 
(in a mathematically correct way), this must be exactly the Taylor expansion of 
order n for f at xo. 
Example 7.6 
Suppose the function f(x) satisfies 
f(x) = 2—3(a— 2) + (e — 2)? 
Then (7.7) implies 


(a 2)3 + o((m—2)8) for x +2. 


VW y) AA y) 1 
f2)=2, faye, LO. CO_? 
hence ; 
f@)=2, f@)=-3, f"@)=2, f"Q)=-5. : 


For simplicity we shall assume henceforth x9 = 0. This is always possible by a 
change of the variables, x > t = x — ao. 


Let now 
f(x) =an t+ayxt+...+a,2" + 0(x”) = py(x) + o(2”) 


and 
g(x) = bo + bya +... + by x” + ofa”) = gn(x) + 0(2™) 


be the Maclaurin expansions of the maps f and g. 


Sums 
From (5.5) a), it follows 


f(x) + 9(@) = [Pn 


t) + o(%”)] = [gn(x) + o(z”)] 
%) +2 On(@)| + |o(e™) + 0(e")| 
= Pn(x) + dn(x) + o(x”). 


The expansion of a sum is the sum of the expansions involved. 


Example 7.7 


Let us find the expansions at the origin of the hyperbolic sine and cosine, intro- 
duced in Sect. 6.10.1. Changing x to —x in 
genre 


2 
a 
vT_ — eae 2n+2 
e tt or v Opa ) 


238 7 Taylor expansions and applications 


gives 
2 ont? oe 
= = 1 = n . 
e amar 1 Gpeeo ) 
Thus 
_ 3 5 gent sis 
sinh z = ~(e” —e pay te + OntD o(a i 
Similarly, 
1 x —2x 7 a a 2n+1 
cosha = 5(e +e oa a ony a Ve 


The analogies of these expansions to sinx and cosx should not go amiss. 


Note that when the expansions of f and g have the same monomial terms up to 
the exponent n, these all cancel out in the difference f —g. In order to find the first 
non-zero coefficient in the expansion of f —g one has to look at an expansions of f 
and g of order n’ > n. In general it is not possible to predict what the minimum n/ 
will be, so one must proceed case by case. Using expansions ‘longer’ than necessary 
entails superfluous computations, but is no mistake, in principle. On the contrary, 
terminating an expansion ‘too soon’ leads to meaningless results or, in the worst 
scenario, to a wrong conclusion. 


Example 7.8 
Determine the order at 0 of 
h(a) = e® — /1 + 22 
by means of Maclaurin’s expansion (see Sect. 7.4 in this respect). 
Using first order expansions, 


f(z) =e" =1+24+ (2), 
g(z) = V1+2x=1+2+0(z2), 
leads to the cancellation phenomenon just described. We may only say 
h(x) = o(), 
which is clearly not enough for the order of h. Instead, if we expand to second 


order 
2 


f(2) =e =14+24 > +o(2”) 


2 
g(x) =V14+2%=14+2- = +0(2"), 
then 
h(x) = 2? + o(x) 


shows h(a) is infinitesimal of order two at the origin. 


Products 
Using (5.5) d) and then (5.5) a) shows that 


7.3 Operations on Taylor expansions 239 


The product pp(x)qn(x) contains powers of x larger than n; each of them is an 
o(x”), so we can eschew calculating it explicitly. We shall write 


Pn(£)dn(#) = Tn(a) + o(@"), 


intending that r,,(x) gathers all powers of order < n, and nothing else, so in 
conclusion 


f(x)g(x) = n(x) + o(2”). 


Example 7.9 
Expand to second order 
h(x) = ¥1+ ae” 


at the origin. Since 


a2 


fle) = ViFe=1+2—= +0(0%), 


2 
g(t) =e" =1+2+ > +0(2”), 
it follows 


h(x) = (.+5-=) (+2+5) + o(a”) 


eee ee el aoe 
7 2 2° 2 4 8 8 16 


3 7 
=1+ 5% + gu + o(2?). 


The boxed terms have order larger than two, and therefore are already accounted 
for by the symbol o(x”). Because of this, they need not have been computed 
explicitly, although no harm was done. 


Quotients 
Suppose g(0) 4 0 and let 


hile) = ?y(e)--o(n”™), with r(¢) = S- cpr, 
k=0 


240 7 Taylor expansions and applications 
From h(x)g(x) = f(x) we have 


Pn(@)dn(x) + o(@") = pr(#) + o("). 


This means that the part of degree < n in the polynomial r,,(x)qn(x) (degree 2n) 
must coincide with p,(x). By this observation we can determine the coefficients c, 
of r,(x) starting from co. The practical computation may be carried out like the 
division algorithm for polynomials, so long as the latter are ordered with respect 
to the increasing powers of x: 


n 


ao tajx + agx? +... tanz” +0 bo + bia + box? +... + bax” + o(x”) 


ag taie+ asx? +... tale" + o(2”) | coteat... + crz” + o(a”) 


(x”) 
(x") 
O+ G12 + Gor? +... + nx” + o(x”) 
(x") 


G@yx+ a,c? +...+4) 2" + o(x 


Examples 7.10 


ez 


—————... By 

3+ 2log(1+ 2) 

i 7.13), we have e* =1+a+ 427+ 0(x?), and 34 2log(1+ 2) =3+22- 
2 

£ 2 + ofa); dividing 


i) Let us compute the second order expansion of h(x) = 


1+2+427+0 3+ 2x — x? + o(x?) 


$+ grt 2? + 0(2?) 


produces h(a) = $+ $x + Ha? + o(2?). 
ii) Expand h(x) = tan x to a: fourth order. The function being odd, it suffices 
to find Maclaurin’s polynomial of degree three, which is the same as the one of 


order four. Since 
3 2 


sine = # — — + 0(x) and cose =1—- > +o(2°), 


dividing 


7.3 Operations on Taylor expansions 241 


¢- = + o0(@") |1= 2 40(23) 


+ o(a3) | a+ x + o(x?) 


yields 


Composite maps 
Let 


f(x) =ain+ fet” Page o(a"") 


be the Maclaurin expansion of an infinitesimal function for x > 0 (hence ao = 0). 
Write 
g(y) = bo + biy +... + bny” + o(y”) 


for a second map g(y). Recall 
o(y”) stands for an infinitesimal of bigger order than y” as y > 0, 
which can be written 
ay” )=y"o(1) with o(1) > 0 for y > 0. 


Now consider the composition h(x) = 9(f(2)) and substitute y = f(x) in the 
expansion of g(y): 


o(f(x)) = bo + bif (x) + de[f(x)]? +... + dnl f(x)” + [f(w)]”0(1). 


As f(x) is continuous at 0, y = f(x) > 0 for z > 0, so o(1) > 0 for x > 0 as 
well. Furthermore, expanding 


[F(a)]" = atx” + o(a”") 


yields 
[f(x)]"0(1) = o(a”) per x > 0. 


The powers [f(a)]* (1 < k < n), expanded with respect to 2 up to order n, provide 
the expression of g(f(z)). 


242 7 Taylor expansions and applications 


Examples 7.11 


i) Calculate to order two the expansion at 0 of 
hig) =e, 


Define 
2 
x 
f(a) = VIF e-1=5- = +o(2”), 
y? 
g(y) =e¥ =1l+yt oy + o(y?). 
Then 


=1+ + o(x). 
ii) Expand to order three in 0 the map 
1 


Ae ==———_—$_ 
(2) 1+ log(1+ 2) 
We can view this map as a quotient, but also as the composition of 


a. ae 3 
fla) = log(l+2)=2- > +> +(e") 
with 
i 
=— —_ = J — 2 49 3) 
g(y) aay yty —y’ +o(y") 
It follows 
2 3 2 3 2 
h(a) = -(e-54+5 o(e®)) + (@- 5 +5 + 0(2%)] 


Remark 7.12 If f(x) is infinitesimal of order greater that one at the origin, we 
can spare ourselves some computations, in the sense that we might be able to 
infer the expansion of h(x) = g(f(x)) of degree n from lower-order expansions of 
g(y). For example, let f be infinitesimal of order 2 at the origin (a; = 0, a2 # 0). 
Because [f(a)]* = aSa?* + o(x?*), an expansion for g(y) of order % (if mn even) or 
2t! (n odd) is sufficient to determine h(x) up to degree n. (Note that f(x) should 


be expanded to order n, in general.) O 


7.3 Operations on Taylor expansions 243 


Example 7.13 


Expand to second order 


h(x) = /cosxz = 1+ (cosz — 1). 


Set 
a ‘ 
f(x) =cosx-1l= os + 0(x*) (2nd order) 
gy) =VJVlt+yasl14+ + o(y) (1st order). 
Then 


h(x) =1+ ; (-5 + o(0?)) + o(a”) 


2 


=l1- ed o(a”) (2nd order). 


Asymptotic expansions (not of Taylor type) 
In many situations where f(x) is infinite for « + 0 (or > Zo) it is possible 
to find an ‘asymptotic’ expansion of f(a) in increasing powers of x (x — xo), by 
allowing negative powers in the expression: 
a— a— 1 
aa] te a ee + 4a tare +.. + ana” + 0(x"). 

This form helps to understand better how f tends to infinity. In fact, if a_, 4 0, 
f will be infinite of order m with respect to the test function x~!. 

To a similar expansion one often arrives by means of the Taylor expansion of 

1 

f(a)’ 


We explain the procedure with an example. 


which is infinitesimal for 7 > 0. 


Example 7.14 
Let us expand ‘asymptotically’, for x — 0, the function 


il 
f(z) — et _ 1° 
The exponential expansion arrested at order three gives 
eo? 3 
e* —1 =2+>+— tole } 
2 
=o (1454 400%), 
Se) 
1 1 


: . 
T 1+ 5+ 2 +02”) 


The latter ratio can be treated using Maclaurin’s formula 


1 
—— =1-yty? + oly’); 
i yty (y") 


244 7 Taylor expansions and applications 


by putting 
2 
e. £ 
Co at o(x?) 
in fact, we obtain 
1 ee . e" iT <a x 
Se ie eee =) eee ee 
f(a) = ( at ip toe ) ; 5 + 7g +O): 
the asymptotic expansion of f at the origin. Looking at such expression, we can 
deduce for instance that f is infinite of order 1 with respect to y(x) = 4, as 


x — 0. 
Ignoring the term #/12 and writing f(x) = 4 - 4 + 0(1) shows f is asymptotic 
to the hyperbola 

2-2 


g(z) = >. 


7.4 Local behaviour of a map via its Taylor expansion 


Taylor expansions at a given point are practical tools for studying how a function 
locally behaves around that point. We examine in the sequel a few interesting 
applications of Taylor expansions. 


Order and principal part of infinitesimal functions 
Let 
f(x) = a9 + a1(a — 20) +... + an(@ — 20)” + o((a — a0)”) 


be the Taylor expansion of order n at a point x9, and suppose there is an index m 
with 1 <m <n such that 
nea =. ]=t,4 =U, but dy, #0. 
In a sufficiently small neighbourhood of xo, 
f(x) =Qm(a — 240)" + o((x — ao)"") 
will behave like the polynomial 
P(x) = Am(x — xo)", 


which is the principal part of f with respect to the infinitesimal y = x — xo. In 
particular, f(x) has order m with respect to that test function. 


Example 7.15 


Compute the order of the infinitesimal f(x) = sinx — x cos x — $x with respect 


to v(x) = x as x > 0. Expanding sine and cosine with Maclaurin we have 
il 
f(z)= “Sq? +o(#*), 2-0. 
Therefore f is infinitesimal of order 5 and has principal part p(x) = a2. 
The same result descends from de |’H6pital’s Theorem, albeit differentiating five 
times is certainly more work than using well-known expansions. 


7.4 Local behaviour of a map via its Taylor expansion 245 


Local behaviour of a function 
The knowledge of the Taylor expansion of f to order two around a point Zo, 


f(x) = a9 + a1(x — x0) + a2(x — 20)? +0((x — 20)”), z— Zo, 
allows us to deduce from (7.7) that 


f (0) = a0, f'(t0) = 41, f" (ao) = 2a. 


Suppose f is differentiable twice with continuity around 79. By Theorem 4.2 the 
signs of agp, a1, @2 (when # 0) coincide with the signs of f(x), f’(x), f’ (a), re- 
spectively, in a neighbourhood of 29. This fact permits, in particular, to detect 
local monotonicity and convexity, because of Theorem 6.27 b2) and Corollary 6.38 
b2). 


Example 7.6 (continuation) 


Return to Example 7.6: we have f(2) > 0, f’(2) < 0 and f’”(2) > 0. Around 
xo = 2 then, f is strictly positive, strictly decreasing and strictly convex. 


We deal with the cases a; = 0 or az = 0 below. 


Nature of critical points 
Let xo be a critical point for f, which is assumed differentiable around x9. By 
Corollary 6.28, different signs of f’ at the left and right of zg mean that the point 
is an extremum; if the sign stays the same instead, ro is an inflection point with 
horizontal tangent. 

When f possesses higher derivatives at xo, in alternative to the sign of f’ 
around xg we can understand what sort of critical point x9 is by looking at the 
first non-zero derivative of f evaluated at the point. In fact, 


Theorem 7.16 Let f be differentiable n > 2 times at x9 and suppose 


CCG) tee ety a (pO DMA Gie all (7.21) 


for some2<m<n. 


i) When m is even, xo is an extremum, namely a maximum if f° (ao) < 0, 
a minimum if f™ (ao) > 0. 

ii) When m is odd, xp is an inflection point with horizontal tangent; more 
precisely the inflection is descending if f° (ao) < 0, ascending if 
ue (x0) oll, 


246 7 Taylor expansions and applications 


Proof. Compare f(x) and f(xzo) around xo. From (7.6)-(7.7) and the assumption 
(7.21), we have 


f(x) — f(xo) = ae A 05)" 40((e=sy)”). 
But o((a — 29)™) = (x — z9)™o(1), so 


f'™ (20) " 


m! 


F(2) = f(e0) = (e-0)" | n(e)]. 


for a suitable h(x), infinitesimal when x — 20. Therefore, in a sufficiently 
small neighbourhood of xo, the term in square brackets has the same sign 
as f (a9), hence the sign of f(x) — f(xo), in that same neighbourhood, 
is determined by f(a9) and (x — 29)". Examining all sign possibilities 
proves the claim. oO 


Example 7.17 

Assume that around x9 = 1 we have 

f() =2—15(x — 1)* + 20(@ — 1)? + o((a — 1)°). (7.22) 
From this we deduce 

f@=f"Q)=fF")=0, but fC) =-360<0. 

Then zo is a relative maximum (Fig. 7.9, left). 
Suppose now that in a neighbourhood of «; = —2 we can write 

f(v) =3 + 10(a + 2)° — 35(@ + 2)’ + o((x + 2)"). (7.23) 
Then 

f'(—2) = f"'(—2) = f'"(-2) = f(-2)=0, and f®(—-2)=10-5!>0, 


telling x; is an ascending inflection with horizontal tangent (Fig. 7.9, right). O 


I =2 
Figure 7.9. The map defined in (7.22), around xo = 1 (right), and the one defined in 
(7.23), around zo = —2 (left) 


7.4 Local behaviour of a map via its Taylor expansion 247 


Points of inflection 
Consider a twice differentiable f around xo. By Taylor’s formulas we can decide 
whether xo is an inflection point for f. 


First though, we need to prove Corollary 6.39 stated in Chap.6, whose proof 
we had to postpone to the present section. 


Proof. a) Let x be an inflection point for f. Denoting as usual by y = t(x#) = 
f(xo) + f’(xo0)(a — xo) the tangent line to f at xo, Taylor’s formula (7.6) 
(n = 2) gives 


(2) — t(2) = 5" (a0)(« ~ 20)? + o(( - 20)*), 


which we can write 
2 1 ii 
Fe) —1(e) = (© = a0)? [5 F"(a0) + Ae) 


for some infinitesimal h at xo. By contradiction, if f’ (29) 4 0, in an arbit- 
rarily small neighbourhood of x9 the right-hand side would have constant 
sign at the left and right of xo; this cannot be by hypothesis, as f is 
assumed to inflect at xo. 

b) In this case we use Taylor’s formula (7.8) with n = 2. For any x ¥ Xo, 
around 2% there is a point %, lying between x9 and x, such that 


f(a) — a) = 54a) (0 — a0)’. 


Analysing the sign of the right-hand side concludes the proof. O 


Suppose, from now on, that f’(ao) = 0 and f admits derivatives higher than 
the second. Instead of considering the sign of f’ around x9, we may study the 
point x by means of the first non-zero derivative of order > 2 evaluated at 2. 


Theorem 7.18 Let f ben times differentiable (n > 3) at xo, with 


Hee OE ea (7.24) 


for some m (83<m<n). 


i) When m is odd, xo is an inflection point: descending if f° (ao) < 0, 
ascending if f° (a9) > 0. 
ii) When m is even, Xo is not an inflection for f. 


Proof. Just like in Theorem 7.16, we obtain 


f™ (zo) 


m!\ 


f(2) — tc) = (e-20)" | + (a), 


248 7 Taylor expansions and applications 


4 


Figure 7.10. Local behaviour of the map (7.25) 


where h(x) is a suitable infinitesimal function for z + x. The claim follows 
from a sign argument concerning the right-hand side. O 


Example 7.19 
Suppose that around x9 = 3 we have 
f(z) = —2 + 4(a — 3) — 90(a — 3)° + o((x — 3)°). (7.25) 


( 
Then f"(3) = f’”(3) = f(3) = 0, f©(3) = —90-5! < 0. This implies that 
xo = 3 is a descending inflection for f (Fig. 7.10). 


7.5 Exercises 


1. Use the definition to write the Taylor polynomial, of order n and centred at 
Xo, for: 


=./2¢+1, n=, t=. 


f) f(r) =2—- 827+ 42° + 927, n= 3, xo =0 


2. Determine the Taylor expansion of the indicated functions of the highest- 
possible order; the expansion should be centred around x9 and have Peano’s 
remainder: 


f(x) = x?|a2| + e?*, t= 0 


b) f(a) =2+a4+(¢-1)V/2?-1, to = 1 


7.5 Exercises 249 


3. With the aid of the elementary functions, write the Maclaurin expansion of 
the indicated functions of the given order, with Peano’s remainder: 


‘a) | f(x) = xcos3a — 3sinz, n=2 
L+2 
] =/4 
|b) | F(z) OT on 
[<)]| f(x) =e” sin2z, n=5 
d) f(r) =e 7°S* + sinz —cosz, w=2 
e) f(x) = Vcos(3a — x7), n=A 
L F 
A acer n=5 
f(x) = cosh? x — 14 222, n=A4 
e7? —] 
h £) = ——_, t= 3 
) F(@) V cos 2% 
1 
i eee, ee 
) F(@) —V/8sinz — 2cosxz 
€) f(x) = V84+ sin 24x22 — 2(1+ 2? cos”), n=4 


4, Ascertain order and find principal part, for x — 0, with respect to p(x) = x 
of the indicated functions: 


a) f(e a ies ee 


; a cosh 27 
— sin _ 9 Lee 
‘) | f(@) SS d) f(x) =2r+4+ (a a : 
4 
510 = - metan = V1 1- 52? +sin 
1 — 42? 18 
5. Calculate order and principal part, when x > +00, with respect to p(x) = + 


of the aaa functions: 


1 
a) f@) =s5- 2(2 — 2) — log(x — 1) 
f(a) =e HF 1 


b) 
f(a) = V1 +30? + 03 — /2 450i pad 


dy f@\= {/2-+ sinh 5 _ 


6. Compute the limits: 


lim (1 + 28)1/(#" sin® 32) b) lim 


x20 x2 (4 = 7 ee 


250 7 Taylor expansions and applications 


1 1 1 1/24 
lim — | ————~ — — ] d) lim (c*" + sin? x — sinh? x) 
x30 \sin(tanz) 2 «0 
on ie 182+ ie 3x“[log(1 + sinh? «)| cosh? x 
«+0 cos 6a — 1+ 6x? e30  1—V/1+23 cos V23 


As a varies in R, determine the order of the infinitesimal map 
h(a) = log cos x + log cosh(az) 
as @ —> 0. 
Compute the sixth derivative of 
= Sh ese x) 


evaluated at x = 0. 


Let 


v(x) = log(1 + 4a) — sinh 4x + 82. 


Determine the sign of y = sin p(x) on a left and on a right neighbourhood of 
ro = 0. 


Prove that there exists a neighbourhood of 0 where 


2cos(x + x”) < 2 —a? — 22. 


Compute the limit 


VW 
lim ° cosh \/x 
aot (& + ~/xr)% 
for all values a € RT. 


Determine a € R so that 
f(x) = (arctan 2x)? — ax sing 


is infinitesimal of the fourth order as x > 0. 


7.5.1 Solutions 


1. Taylor’s polynomials: 


a) All derivatives of f(x) = e® are identical with the function itself, so f‘*)(2) = 
e?, Vk > 0. Therefore 


T fa.o(x) =e? + e? (a — 2) + Ele — 2)? + “te — 2) + a —2)*. 


7.5 Exercises 251 


b) Tfo.g(2) =1- 5-5) + ale - 5) - S@— 2) 
c) From f"(a) = 2 f"(a) = > f(z) = it follows f(3) = log3, f’(3) = =, 
f"(3) = a f'"(3) = Then 
fsa (t) = log3 + 5(e ~3) — (e-3)? + E(w 3) 
a) Pfsala) =3+ 3-4) - pe 4)? + eo — 4% 


e) As f'(x) =1—6x+152?, f(x) = —6 + 302, we have f(1) = 10, f’(1) = 10, 
f"'(1) = 24 and 


T foa(z) = 10+ 10(@ — 1) + 12(# — 1)”. 


Alternatively, we may substitute t = «—1, i.e. x = 1+t. The polynomial f(z), 
written in the variable t, reads 


g(t) = fl +4) =7+(1 +28) —3(11 +4)? +5(1 +2)? = 10 + 10¢ + 122? + 5e°. 


Therefore the Taylor polynomial of f(a) centred at xp = 1 corresponds to the 
Maclaurin polynomial of g(t), whence immediately 


T 92,0(t) = 10+ 10¢ + 12¢?. 


Returning to the variable x, we find the same result. 
f) T f3,0(x) =2= 8x? + Ag. 


2. Taylor’s expansions: 


a) We can write f(x) = g(x) + h(z) using g(x) = x?|z| and h(x) = e?*. The sum 
h(a) is differentiable on R ad libitum, whereas g(x) is continuous on R but 
arbitrarily differentiable only for x 4 0. Additionally 


j ac? ae SD i or ie > 0, 
g(x) = a = g (#) = 
—32° ifa<0O, —6z ifa<0O, 


sO 


u / : / : M : 1 
sg, 2) = Jin (2) =, ig, o"(2) = Jig 2) = 0 
By Theorem 6.15 we infer g is differentiable twice at the origin, with vanishing 
derivatives. Since g(x) = 6|2| is not differentiable at x = 0, g is not differen- 
tiable three times at 0, which makes f expandable only up to order 2. From 
h'(x) = 2e?” and h(x) = 4e?*, we have f(0) = 1, f’(0) = 2, (0) = 4, so 
Maclaurin’s formula reads: 


f(z) =14 2x2 4+ 22? + o(x?). 


252 7 Taylor expansions and applications 


b) The map is differentiable only once at x = 1, and the expansion is f(x) = 
3+ (2-—1)+o0(2%—-1). 


3. Maclaurin’s expansions: 


a) f(x) = —22 + o(x?). 
b) Writing f(x) = log(1+ x) —log(1+ 3), we can use the expansion of log(1 +t) 
with ¢= 27 andt = 3z 
a og gt (3a)? 7 (32) ‘. (32)4 
a a a 3 4 


+ o(x*) 


c) Combining the expansions of e¢ with t = x”, and of sint with t = 2z: 


f(a) = (1 +a° + e + o(e°)) (20 2 Ces o(e*)) 


3! 5! 
4 4 4 
= 2e 9g? eg ao gt fc toys: iB” x° + o(2°) 
2 1 
= 264+ =o? — —a2” + 0(2*). 


3 15 
d) f(x) =x? + o(x?). 
e) f(a) =1— 32? + 43 — 3at + o(c’). 
f) This is solved by eeecae cs (1+ t)° and changing a = —4 and t = 2?: 


- 1 = 
Wate = (1 + 2”) ue = 2 (1 = ae + ( oe + ofa") 
x 


re 1 4 
fla) =a- Ee + oe at Ee — ae + o(x°) = Fa" + 0o(x”). 


g) Referring to the expansions of coshz and (1+ t)®, with a = s, tS 27: 


f(x) = (14+ 50° +704 + o(a ‘)) — (14222) 


1 2 1 

= 14a? + fat + Fa? + f(z") — (1 + 52a? + & (20:7)? + o(e')) 
2 1 4 2 l 4 4 ) 4 4 

=1l+z2 + 32 —l-z + 5% On) > —o + o(a*). 


h) f(a) = 2@ + 20? + Ba? + o(x3). 


7.5 Exercises 253 


i) Substitute to sinz, cosa the respective Maclaurin expansions, to the effect 
that 


f(a) 
LSS —_—_———S SS =. 
—2— Bx + 22 + ¥823 + o(2) 


Expansion of the reciprocal eventually gives us 
1 y2 5 


f(a) =-5 


17 
2 3 3 
— —v2 : 
5+ 5% qz +iv20 + o(z”) 


0) f(x) = —2x7 + o(a*). 


4. Order of infinitesimal and principal part for x — 0: 


a) The order is 2 and p(x) = —2e2? the principal part. 
b) Write 
cos 2z + log(1 + 4x) — cosh 2x 
A(z) = —A A vO 
cosh 2% 
and note that the order for x + 0 can be deduced from the numerator only, for 
the denominator converges to 1. The expansions of cost, log(1 +t) and cosht 
are known, so 


? 


cos 2x + log(1 + 4a”) — cosh 2x 


i 5 (22)? + = (20)4 4 (25)? = 5 (22)! =i 5 (22)? — = (204 + o(a*) 


= —82* + o(z*). 


Thus the order is 4, the principal part p(x) = —82+. 


c) Expanding sint and e’, then putting t = \/z, we have 


8—sin?t  t8— (¢— de +0(t%))* 4 +0) 1 
o(t) = Haan t FE FO) at te) da out 
e3t — ] 1+3t+o(t)-1 3t + o(t) 6 
for t + 0. Hence 1 
f(a) = ex° + o(x*), 


implying that the order is 2 and p(x) = $2”. 


d) The map has order 3 with principal part p(x) = ae 


e) Use the expansion of (1 + t)® (where a = —3) and arctant: 
1 —42?)-V/? =1 + 22? 4+ o(x3), a eg? 4 ofg4 
1 
arctan ——— = « +. 223 + o(a*) — =(a — 22? + o(x*))? + o(x3) 
1 — 4x? 3 


5 
=a2+ 3 + o(x*). 


254 7 Taylor expansions and applications 


In conclusion, 
5 
f(2)= ae + o(x?),, 
so that the order is 3 and the principal part p(x) = —32°. 
f) Order 6 and principal part p(x) = (-3 + sare. 


5. Order of infinitesimal and principal part as 7 — +00: 
a) When x — +00 we write 
xz — 2 —log(x# — 1) 
tO) er oa — Fe ao a 
2(a% — 2)? — (a — 2) log(# — 1) 
x — 2 — log(x — 1) 
2x? — 84+ 8 — (x — 2) log(x — 1) 
x + o0(x) 1 ( 1 ) 
SS Se + O _ ; 


2x? + 0(x?) 2x zm 


from which one can recognise the order 1 and the principal part p(z) = 3. 
b) The map is infinitesimal of order one, with principal part p(x) = —z. 


c) Write 


Using the expansion of (1+ t)® first with a = z, t= 3+4, then with a= , 
— 2 + =, we get 


SiR 


Therefore the order is 1, and p(x) = 


d) The order is 2 and p(x) = SES 


7.5 Exercises 


6. Limits: 


a) Let us rewrite as 
——.—  log(1+ 2° 
a sin? 3a Bl ) 


_ log(1+ 2°) L 
=e lim. ——~—— ] =e”. 
oF (in x4 sin? 3x 


lim (1 + 8) 1 (@" sin” 32) — lim exp ( 
x—0 x—0 


To compute L, take the expansions of log(1 + t) and sint: 


6 6 6 6 1 
bea lim 2 228) ea, ETO) 
x0 ¢4(32 + o(@?))2 240928 + o(x®) 9 


The required limit is e!/9. 


b) s2g7. 


c) Expanding the sine and tangent, 


i Ge sin(tanz) _ am 2 tenet z tan? x + o(2) 
c>0 g?sin(tanz) 230 x?(tan x + o(2)) 
_ x—-a—sa%+ta%+0(23)  . —t2%+0(x3) 1 
= lim 2A SSS _ = lim SS — _ = - =. 
«0 x? + o(x) z+0 2° + o(x3) 6 
dye 2/5: e) -1. 


f) Observe that 
3x*[log(1 + sinh? «)| cosh? « ~ 32+ sinh? x ~ 32°. 
for x — 0. Moreover, the denominator can be written as 


Den : 1— (1+ 2°)!/? cosa?/? 


=1- (1 - 5° + ee 8 + ofa") (1 - 2° - Fe + ofa") 


250 


1 1 1 1 
=1- (1 + st al st —2% + — 7% + o(0°)) = ze + o(x®), 


The limit is thus 


7. Expand log(1 + t), cost, cosht, so that 


2 4) 


h(x) = log (1 ee ee ee o(e*)) + log (1 + 5 (a2)? + —(ax)* + o(e*)) 


256 7 Taylor expansions and applications 


= —52? +z — ; (-5 ae ta") + o(x°) + a + oy! = 
-; (S ae A) + o(a°) 
= 5 (a? — 1)x? + € = 3) (a4 + 1)ar* + o(x°). 
If a £ +1, h(x) is infinitesimal of order 2 for x > 0. If a = +1 the first non-zero 


4 


coefficient multiplies 7*, making h infinitesimal of order 4 for x — 0. 


8. In order to compute h‘ (x) at 2 = 0 we use the fact that the Maclaurin 


: ; rn (0 ; ‘ 
coefficient of «° is ag = = ) Therefore we need the expansion up to order six. 


Working on sint and sinht, the numerator of h becomes 
2 ae 6 
Num : sinh|2*+2(2 — 3 + o(z”) 


4 4 1 
= sinh G + 227 — 3t° + o(0°)) = 7 + Int — —98 + —o + o(a°) 


3 3 


7 
a? + Qo* — rl + o(x®), 
Dividing x? + 2x4 — Z2° + o(2°) by 14+ 21° one finds 
2 a_ 1 6 6 
ha) =o 2a 52 + o(x°), 


so h(9)(0) = —2 - 6! = —840. 
9. Use the expansions of log(1 + t) and sinht to write 
il iL i 32 
y(x) = 4x — 5 (4) + 3 (4x)" — 4x — 3 (4x) + 8a? + o(x?) = ra + o(a°). 
Since the sine has the same sign as its argument around the origin, the function 


y =siny(z) is negative for x < 0 and positive for x > 0. 


10. Using cost in Maclaurin’s form, 
2 1 22, 1 2)4 2)4 
2cos(z + 2*) = 2 1-s(e+2 ) + Ge+e )* +0 ((a4+27)*) 
1 
=9= (9° or? +4") + Tara + o(x*) 


11 
= 2-47 -— 22° - Til + o(a*) 
on some neighbourhood J of the origin. Then the given inequality holds on J, 
because the principal part of the difference between right- and left-hand side, 


clearly negative, equals —i.a*. 


7.5 Exercises 


11. Expand numerator and denominator separately as 


1 1 2 1 1 
Num : 1+=2%+- (=) + o(z”) — (1 + 5% + a + o(e?)) 


Then 
x 1 
fen © ? — cosh Vx _ fie 5x? + o(2”) 
a0t (a + </r)% x0+ #¢/® (1 + ax4/> + o(x4/5)) 
1 a 
HSS a 
— ifa=1 
12 a i eas 
=)0 #2>5,-= 40 ifa<10, 
+00 if2<= +oo ifa> 10. 


12. Writing arctant and sint in Maclaurin’s form provides 


‘e= (20 7 = (2n)° + o(e*)) oe (« _ =a + o(e°)) 


32 
= 4g? — ae + o(x*) — ax? + aa + o(2*) 


ncaa (F F) a + ole") 


257 


This proves f(x) infinitesimal of the fourth order at the origin if a = 4. For such 


value in fact, 
f(z) = 10x* + o(x*). 


8 


Geometry in the plane and in space 


The chapter has two main goals. The first is to discuss the possibilities of repres- 
enting objects in the plane and in three-dimensional space; in this sense we can 
think of this as an ideal continuation of Chap.1. We shall introduce coordinate 
systems other than the Cartesian system, plus vectors and their elementary prop- 
erties, and then the set C of complex numbers. Secondly, it is a good occasion for 
introducing concepts that will be dealt with in more depth during other lecture 
courses, for instance functions of several variables, or the theory of curves in space. 


8.1 Polar, cylindrical, and spherical coordinates 


A point P in the Cartesian plane can be described, apart from using the known 
coordinates (x,y), by polar coordinates (7,0), which are defined as follows. 
Denote by r the distance of P from the origin O. If r > 0 we let @ be the angle, 
measured in radians up to multiples of 27, between the positive x-axis and the 
half-line emanating from O and passing through P, as in Fig. 8.1. It is common to 


P= (z,y) 


O x 


Figure 8.1. Polar and Cartesian coordinates in the plane 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_8, 
© Springer International Publishing Switzerland 2015 


260 8 Geometry in the plane and in space 


choose @ in (—a, 7], or in [0, 27). When r = 0, P coincides with the origin, and 0 
may be any number. 
The passage from polar coordinates (r,@) to Cartesian coordinates (x,y) is 


given by 
0 


The inverse transformation, provided @ is chosen in the interval (—7, 7], is 


arctan J ieee (Ue 
cE 


arctan = +7 realy 20, 
ue 
arctan = —_ i Oya (8.2) 
ife2=0,y7 50, 


ee 


Examples 8.1 


i) Let P have Cartesian coordinates (x,y) = (6/2, 2/6). Its distance from the 
origin is 

r= VJV72+24= /96 =4V6. 
As x > 0, 


2/6 3 
6 = arctan 2v6 = arctan — i 


6/2 3. CO 
The polar coordinates of P are then (r,0) = (4v6, ae 


ii) Let now P have Cartesian coordinates (a, y) = (—5, —5). Then r = 5/2, and 
since x,y < 0, 


6 ae tan1 = 
= arctan — — 7 = arctan - T= - - 7 > - TT 
25 4 4 


whence (r, 0) = (5V2, —n). 


2 
iii) Take P of polar coordinates (r,6) = (4, 37) this time; in the Cartesian 


system 
2 
f= 4cos 37 = 4cos (m — =) = —4.cos = = -2, 
2 
y = Asin Sa = 4sin (w— 2) =4sin 5 = 2V3. o 


Moving on to the representation of a point P € R® of coordinates (z, y, 2), 
we shall introduce two new frame systems: cylindrical coordinates and spherical 
coordinates. 


8.1 Polar, cylindrical, and spherical coordinates 261 


The cylindrical system is simply given by replacing the coordinates (x,y) of 
the point P’, orthogonal projection of P on the xy-plane, by its polar ones (r’, 4), 
and mantaining z as it is. Denoting (r’,0,t) the cylindrical coordinates of P, 
we have 


In this case too the angle @ is defined up to multiples of 27; if we confine @ to the 
interval (—7, 7], as above, cylindrical coordinates are functions of the Cartesian 
ones by defining r’ and 6 with (8.2) (Fig. 8.2, left). 

Spherical coordinates (r, y, 0) are defined as follows. Let r = \/a? + y? + 2? 
be the distance of P from the origin, y the angle between the positive z-axis and 
the ray from O through P, @ the angle between the positive x-axis and the line in 
the xy-plane passing through the origin and the projection P’ of P on the same 
plane. This is probably better understood by looking at Fig. 8.2, right. Borrowing 
terms from geography, one calls 6 the longitude and y the colatitude of P 
(whereas $ — y is the latitude, in radians). 

Therefore z = r cosy, while the expressions 7 = 7’ cos@ and y = 1’ sin @ derive 
from noting that 7’ is the distance of P’ from O, r’ = rsiny. Then the Cartesian 
coordinates of P are, in terms of the spherical triple (r, y, 4), 


x=rsinycos6, y=rsingsing, 2= PF EOS D. 


The inverse transformation is easily found by dimensional reduction. We just re- 
mark that it is enough to vary vy in an interval of width 7, e.g. [0,7]. Instead, 0 
has freedom 27, for instance 6 € (—7, 7], as in the 2-dimensional case. 


P= (a9 0) = (a7,0) 


Figure 8.2. Cylindrical coordinates (left) and spherical coordinates (right) 


262 8 Geometry in the plane and in space 


Example 8.2 
Consider the point P of Cartesian coordinates (1,1, 6). The point P’ = (1, 1,0) 
is the orthogonal projection of P onto the xy-plane, so its polar coordinates are 
(r’,0) = (v2, *) in that plane. The cylindrical coordinates of P are therefore 


G0.) = (9/2, me V6). 


Now to spherical coordinates. First, r = /I+1+6 = 2v2; moreover, siny = 


= = 4 implies y = 7/6, because ¢ varies in [0,7]. Therefore P has coordinates 


(6.3) = (2/2, 7’ ae Oo 


8.2 Vectors in the plane and in space 


We discuss the basics of Vector Calculus, which focuses on vectors and how they 
add, multiply, and so on. We start with vectors whose initial point is the origin, 
and later generalise this situation to arbitrary initial points in the plane or in 
space. 


8.2.1 Position vectors 


Equip the plane with an orthogonal frame system. A pair (x,y) 4 (0,0) in R? 
identifies a position vector (or just vector) v in the plane, given by the line 
segment with initial point O = (0,0) and end point P = (z,y), see Fig. 8.3, left. 
(The orientation from O to P is indicated by an arrow with point at P.) 

The coordinates x, y of P are said components of the vector v (in the chosen 
frame system); one writes v = (x, y), identifying the vector v with its end point P. 

Position vectors in space are defined in a similar fashion: a vector v with 
components (x,y,z) 4 (0,0,0) is drawn as the oriented segment going from O = 
(0,0,0) to P = (a, y, z) (Fig. 8.3, right), so one writes v = (z, y, z). 


P= (a,y,z) 


P= (z,y) e 


Figure 8.3. A vector in the plane (left), and in space (right) 


8.2 Vectors in the plane and in space 263 


In space or in the plane, the vector 0 with components all zero is called the 
zero vector; it is identified with the origin and has no arrow. In this way position 
vectors in the plane (or in space) are in bijective correspondence with points of 
R? (resp. R?). Henceforth we shall not specify every time whether we are talking 
about planar or spatial vectors: the generic v, of components (v1, v2) or (v1, V2, V3), 
will be described with (v1,...,va). The capital letter V will be the set of vectors 
of the plane or of space, with no distinction. 

Having fixed the origin point O, a vector is intrinsically determined (irrespect- 
ive of the chosen Cartesian frame) by a direction, the straight line through the 
origin and containing the vector, an orientation, the direction given by the arrow, 
and a length or norm, the actual length of the segment OP. Rather often the 
notion of direction tacitly includes an orientation as well. 


Let us define operations. Take vectors v = (v1,...,Uq) and w = (wj,..., Wa). 
The sum of v and w is the vector v + w whose components are given by the sum 
of the corresponding (i.e., with the same subscript) components of the two original 
vectors 


vtw = (vi +w1,...,va+ Wa). (8.3) 


In Vector Calculus real numbers A € R are referred to as scalars. The product 
of the vector v by (the scalar) \ is the vector Av, whose jth component is the 
product of the jth component of v by A 


MO = (Av1, ono Ava) : (8.4) 


The product (—1)v is denoted —v and said opposite vector to v. The difference 
v — w is defined as 


v—w=v+(-w) = (v1 —w1,...,Ua — Wa)- (8.5) 


The operations just introduced enjoy the familiar properties of the sum and the 
product (associative, commutative, distributive, ...), due to their component-wise 
nature. 


These operations have also a neat geometric interpretation. If \ > 0, the vector 
Av has the same direction (and orientation) as ¥, i.e., it lies on the same (oriented) 
straight line, and its length is \ times the length of wv (see Fig. 8.4); if A < 0, 
Av = —|A\|v = |A|(—v) so the same argument applies to —v. Two vectors v and w 
are parallel, or collinear, if w = Av for a \ 4 0. 

The sum of non-zero position vectors v and w should be understood as follows. 
When the two vectors are collinear, w = Av, then v + w = (1+ A)v, parallel to 
both of them. Otherwise, v and w lie on distinct straight lines, say r, and r,,, that 
meet at the origin. Let IT be the plane determined by these lines (if v and w are 
vectors in the plane, clearly I/ is the plane); v and w determine a parallelogram on 
IT (Fig. 8.5). Precisely, let P,Q be the end points of v and w. The parallelogram 
in question is then enclosed by the lines r,, ry, the parallel to r,, through P and 


264 8 Geometry in the plane and in space 


Q = (Az, Ay) 


O 


Figure 8.4. The vectors v and Av 


the parallel to r, through Q; its vertices are O, P, Q and R, the vertex ‘opposite’ 
the origin. The sum v + w is then the diagonal OR, oriented from O to R. The 
vertex R can be reached by ‘moving’ along the sides: for instance, we can start at 
P and draw a segment parallel to OQ, having the same length, and lying on the 
same side with respect to ry. 

Figure 8.6 represents the difference v— wi: the position vector v—w = v+(—w) 
is the diagonal of the parallelogram determined by vectors v, —w. Alternatively, 
we can take the diagonal QP and displace it ‘rigidly’ to the origin, i.e., keeping it 
parallel to itself, finding v — w. 

The set V of vectors (in the plane or in space), equipped with the operations 
of sum and multiplication by a scalar, is an example of a vector space over 
IR. Any vector v = Av; + Wve, with v1,v2 € V and A, € R is called a linear 
combination of the two vectors v; and vg. This generalises to linear combinations 
of a finite number of vectors. 


Figure 8.5. Sum of two vectors v + w 


8.2 Vectors in the plane and in space 265 


Figure 8.6. Difference vector v — w 


Examples 8.3 
i) Given vectors v1 = (2,5, —4) and vg = (—1,3,0), the sum v = 3v1 — 5vz2 is 
v = (11,0, —-12). 
ii) The vectors v = (/8, —2,2/5) and w = (2, —V2, V/10) are parallel, since the 
ratios of the corresponding components is always the same: 


V8_ =2 _ V5 _ a. 


2 V2 V0 


hence v = /2w. 


8.2.2 Norm and scalar product 


The norm of a position vector v with end point P is defined, we recall, as the 
length of OP, i.e., the Euclidean distance of P to the origin. It is denoted by the 
symbol ||v|| and can be expressed in terms of v’s components like 


vr + v3 itd —2, 


Yuituzt+u, ifd=3. 


The norm of a vector is always non-negative, and moreover ||v|| = 0 if and only if 
v = 0. The following relations hold, proof of which will be given on p. 269: 


|Av]] = [Al lel, |v + wll < [loll + [leo] (8.6) 


for any v,w € V and any AER. 
A vector of norm 1 is called unit vector, and geometrically, it has end point 
P lying on the unit circle or unit sphere centred at the origin. Each vector v has 


266 8 Geometry in the plane and in space 
a corresponding unit vector 6 = Tae parallel to v. Thus v = ||v|| 6, showing that 
any vector can be represented as the product of a unit vector by its own length. 


Let us introduce the operation known as scalar product, or dot product of 
two vectors. Given v = (v1,...,Ua) and w = (wi,..., wa), their dot product is the 
real number 


V{W1 + V2W2 tind — 2.5 


d 
V-W= ) U,;Wi = 
i=1 


Vj{W1 + VoW2+v3wW3 ifd=3. 


Easy-to-verify properties are: 


V-W=W-Y, (8.7) 
(Avi + v2) > w = (v1 -w) + (ve: w). (8.8) 


for any V,W,U1, 02 € V,A,w ER. 
A vector’s norm may be defined from the scalar product, as 


|v = Vow (8.9) 


for any vu € V. Vice versa, for any v,w € V, one has 
1 
vw =5(v + wll” — lell? — lel’), (8.10) 


which allows to compute scalar products using norms (see p. 269 for the proof). 
Furthermore, a fundamental relation, known as Cauchy-Schwarz inequality, 
holds: for every v,w € V 
|v -w] < |o]] lw]. (8.11) 


v-w = |lv|| |/w|| cos é (8.12) 


where @ is the angle formed by v and w (whether @ is the clockwise, anti-clockwise, 
acute or obtuse angle is completely irrelevant, for cos 0 = cos(—@) = cos(2m — @)). 
Formulas (8.11) and (8.12) as well will be proved later. 

The dot product leads to the notion of orthogonality. Two vectors v, w are 
said orthogonal (or perpendicular) if 


Even more precisely, 


v-w=0; 


formula (8.12) tells that two vectors are orthogonal when either one is the zero 
vector, or the angle between them is a right angle. By (8.10), the orthogonality of 
v and w is equivalent with 


|v + wll? = |lo|l? + [leol*, 


8.2 Vectors in the plane and in space 267 


O 


Figure 8.7. Pythagoras’s Theorem 


well known to the reader under the name Pythagoras’s Theorem (Fig. 8.7). 
Given a vector v and a unit vector u, the component of v along w is the 
vector 


Uy, — (UU) a 


while the component of v orthogonal to wu is the complement 


Therefore the vector v splits as a sum 
VU = Vy t Vy with Dy 0,4 =O, (8.13) 


a relation called orthogonal decomposition of v with respect to the unit vector 
u (Fig. 8.8). 


O 


Figure 8.8. Orthogonal decomposition of v with respect to the unit vector u 


268 8 Geometry in the plane and in space 


zie 


Figure 8.9. The unit vectors 7,7,k 


Examples 8.4 
i) v = (1,0, V3) and w = (1,2, V3) have norm 
lvl] = V1+0+43=2, — wl] = V1 +443 =2v2; 
their scalar product is v-w=1+0+3=4. 


To compute the angle 6 they form, we recover from (8.12) 


v-W _ V2 


cos 9 = ————_ = —, 
loll lel 2 
s00= 4 
ii) The vectors v = (1,2,-—1), w = (—1,1,1) are orthogonal since v- w = 
=a 8, 


iii) Take the unit vector u = “ie FR “a Given v = (3,1,1), we have 


u=V5+—- a= V5 


so the component of v along wu is 


1 1 1 
Vu = v3(—, —,-—) = l, | ; 
V3 V3 V3 ( ) 
while the orthogonal component reads 
@,,1 = (3, 1,1) —-G,1.—1) = 2,02). 


That (8.13) holds is now easy to check. Oo 


We introduce the unit vectors ¢ = (1,0,0), 7 = (0,1,0) and k = (0,0,1) of 
space, which are parallel to the axes of the Cartesian frame (Fig. 8.9); at times 
these unit vectors are denoted e1, €2, e3. They are clearly pairwise orthogonal 


i-j=j-k=i-k=0. (8.14) 


They form a so-called orthonormal frame for V (by definition, a set of pairwise 
orthogonal unit vectors). 


8.2 Vectors in the plane and in space 269 


Let v = (v1, v2, U3) be arbitrary. Since 


v = (v1, 0,0) + (0, v2, 0) + (0, 0, v3) 
= v1(1,0,0) + v2(0, 1,0) + v3(0,0, 1) 


Vv =VU1t+ Vj + UZk. 


This explains that any vector in space can be represented as a linear combination 
of the unit vectors 2,7, k, whence the latter triple forms an orthonormal basis of 
V. The dot product of v with the orthonormal vectors 2, 7, k yields the components 
of v 


we write 


Up =Vv-t, Vg =V°J, v3 =v-k. 
Summarising, a generic vector v € V admits the representation 
v=(v-tit+(v-jJ+(-k)k. (8.15) 
Similarly, planar vectors can be represented by 
v=(v-ijit(v-j)j 
with respect to the orthonormal basis made by 7 = (1,0) and 7 = (0,1). 


Proofs of some formulas above 


Proof. We start from (8.6). The equality follows from the definition of norm. The 
inequality descends from the definition in case v and w are collinear; for 
generic v, w instead, it states a known property of triangles, according to 
which any side is shorter than the sum of the other two. In the triangle 
OPR of Fig. 8.5 in fact, ||v + w|| = |OR], |u|] = |OP| and ||w|| = |PRI. 
Formula (8.10) derives from expanding ||v + w]||? using (8.7)—(8.9) as fol- 
lows: 

lv + wl]? = (v+w)-(v+w) 
=v-v+w-vt+v-w+w-w (8.16) 
= |lv||/? + 20-w + ||w||?. 

The Cauchy-Schwarz inequality (8.11) can be proved by writing the second 
of (8.6) as ||v + wl? < (||| + ||)”. For the left-hand side we use (8.16), 
so that v-w < ||v]| ||w||; but the latter is (8.11) in case v- w > 0. When 
v-w <0, it suffices to flip the sign of v, to the effect that 


Jv. wl = —v- w= (—v)-w <||— ofl |lewl] = loll led. 


Eventually, let us prove (8.12). Suppose v and w are non-zero vectors 
(for otherwise the relation is trivially satisfied by any @). Without loss of 
generality we may assume 0 < 0 <7. Letu= w= Tw be the unit vector 
corresponding to w. Then the component of v along w is 

v- Ww 


= Tey (8.17) 


Vu 


270 8 Geometry in the plane and in space 
Q 


O 


Figure 8.10. Projection of v along w (the angle formed by the vectors is acute on the 
left, obtuse on the right) 


Suppose first that 0 < 6 < 7/2. In the triangle OP’P (Fig. 8.10, left) 
|v. || = |OP’| = |OP|cos@ = ||v|| cos@; as v,, has the same orientation as 
u, we have 


Vy = ||v|| cosdu. (8.18) 


If 0 is obtuse instead, in Fig. 8.10, right, we have ||v,,|| = ||v|| cos(a — 0) = 
—||v|| cos@; precisely because now v,, has opposite sign to wu, (8.18) still 
holds. In the remaining cases 0 = 0,7/2,7 it is not hard to reach the 
same conclusion. Comparing (8.17) and (8.18), and noting Av = jv means 
A = pif v £0, we finally get to 


U:-W 


= ||v|| cos 0, 
lw al 


whence (8.12). Oo 


8.2.3 General vectors 


Many applications involve vectors at points different from the origin, like forces in 
physics acting on a point-particle. The general notion of vector can be defined as 
follows. 

Let v be a non-zero position vector of components (v1, v2), and Pp an arbit- 
rary point of the plane, with coordinates (x%01, £02). Define P; by the coordinates 
(%11,%12) = (#01 + V1, 202 + v2), as in Fig.8.11. The line segment PoP; from Po 
to P, is parallel to v and has the same orientation. We say that it represents 
the vector v at Po, and we write (Po,v). Vice versa, given any segment going 
from Po = (201,202) to Py = (#11,212), we define the vector v of components 
(v1, V2) = (@11 — %o1, L12 — Log). The segment identifies the vector v at Po. 

A general vector in the plane is mathematically speaking a pair (Po, v), whose 
first component is a point Po) of the plane, and whose second component is a 


8.3 Complex numbers 271 


O 


Figure 8.11. The position vector v and the same vector at Po 


position vector v. Normally though, and from now onwards, the vector (Po, v) 
shall be denoted simply by v; we will make the initial point Po explicit only if 
necessary. Analogous considerations are valid for vectors in space. 

The operations on (position) vectors introduced earlier carry over to vectors 
with the same initial point. The vectors (Po,v) and (Po, w) add up to (Po,v) + 
(Po, w), equal to (Po, v + w) by definition. Operations between vectors at different 
points are not defined, at least in this context. 


8.3 Complex numbers 


According to conventional wisdom, not every algebraic equation 
p(x) = 0 


(p being a polynomial of degree n in x) admits solutions in the field of real numbers. 
The simplest example is given by p(x) = x? + 1, i-e., the equation 


a? = —1, (8.19) 


This would prescribe to take the square root of the negative number —1, and it is 
well known this is not possible in R. The same happens for the generic quadratic 
equation 

ax? + br +c=0 (8.20) 


when the discriminant A = b? — 4ac is less than zero. The existence of solutions of 
algebraic equations needs to be guaranteed both in pure and applied Mathematics. 
This apparent deficiency of real numbers is overcome by enlarging R to a set, 
called complex numbers, where adding and multiplying preserve the same formal 
properties of the reals. Obviously defining this extension-of-sorts so to contain 
the roots of every possible algebraic equations might seem daunting. The good 
news is that considering equation (8.19) only is sufficient in order to solve any 
algebraic equation, due to a crucial and deep result that goes under the name of 
Fundamental Theorem of Algebra. 


272 8 Geometry in the plane and in space 
8.3.1 Algebraic operations 


A complex number z can be defined as an ordered pair z = (x, y) of real numbers 
x,y. As such, the set of complex numbers C can be identified with R?. The reals 
x and y are the real part and the imaginary part of z 


t= Rez and y=imz 


respectively. The subset of complex numbers of the form (z,0) is identified with 
R, and with this in mind one is entitled to write R C C. Complex numbers of the 
form (0, y) are called purely imaginary. 

Two complex numbers z1 = (21,41), z2 = (%2,y2) are equal if they have 
coinciding real and imaginary parts 


21 = 22 => Zi=M. and w= ye. 


Over C, we define the sum and product of two numbers by 


zy + 22 = (21, y1) + (v2, yo) = (21 + 22, y1 + y2) (8.21) 
21 2 = (41, y1) (©, yo) = (41 Lo — Yi Yo, 1 Y2 + X21). (8.22) 
Notice 
(x,0) + (0,y) = (zy), (0,1) (y,0) = (0, y), 
so 


(x,y) = (w, 0) + (0,1) (y, 0). (8.23) 
Moreover, (8.21) and (8.22) are old acquaintances when restricted to the reals: 
(21,0) + (vo, 0) = (a1 + 22,0) and (x1, 0) (2,0) = (x1 42,0). 


In this sense complex numbers are a natural extension of real numbers. 
Introduce the symbol i to denote the purely imaginary number (0, 1). By identi- 
fying (r,0) with the real number r, (8.23) reads 


called Cartesian form or algebraic form of z = (x,y). 
Observe that 


= (1), 1) =i=1,0) = =, 


so the complex number 7 is a root of equation (8.19). The sum (8.21) and multi- 
plication (8.22) of complex numbers in Cartesian form become 


Rit 22 = (a Yi) (ho + Yo) — 21 Bo 1 Yo) (8.24) 


8.3 Complex numbers 273 


A a= Ga AF iy1) (xo are iy2) =2%1%2—-Y1y2+ ee y2+ 2X2 y1) : (8.25) 


2 


The recipe is to use the familiar rules of algebra, taking the relation 7~ = —1 into 


account. 
The next list of properties is left to the reader to check: 


24+ 22 = 224+ 21, 21 2g = 2221, 
(21 + 22) + 23 = 21 + (zo + 23), (21 22) 23 = 21 (Ze 23), 


Z41 (z2 + 23) = 21 22+ 21 23 


for any 21, 22,23 € C. The numbers 0 = (0,0) and 1 = (1,0) are the additive and 
multiplicative units respectively, because 


z+0=04+2=z2 and glHlezjz2, vee UC; 


The opposite or negative of z = (x,y) is the complex number —z = (—2z, —y), 
in fact z + (—z) = 0. With this we can define, for any 21, z2 € C, the difference: 


2 — 2g = 21 + (—22) 
or, equivalently, 
v1 + iyi — (to + tye) = 41 — G2 +i(y1 — yo). 


The inverse or reciprocal of a complex number z # 0, denoted + or z~', is given 
by the relation zz~! = 1, and it is easy to see 


. = 3,2 t'a 3° 
Zz ue+y ve +y 


The formula 


71, pd Fiver yiy2 , , Tayi — 21 y2 
pie, 7 ee 2 2 2 2 
22 Ly + Yo Ly + Yo 


defines the ratio or quotient of 2), z2 € C with zg £ 0. 
At last, let us remark that the ordering of real numbers cannot be extended to 
C to preserve the compatibility properties of Sect. 1.3.1 in any way. 


8.3.2 Cartesian coordinates 


With the identification of C and R?, it becomes natural to associate the number 
z= (x,y) = x +i7y to the point of coordinates x and y in the Cartesian plane 
(Fig. 8.12). The point z can also be thought of as the position vector having end 
point at z. The horizontal axis of the plane is called real axis and the vertical 
axis imaginary axis. For any 21,22 € C the sum 2; + z2 corresponds to the 
vector obtained by the parallelogram rule (as in Fig. 8.13, left), while z1 — zg is 
represented by the difference vector (same figure, right). 


274 8 Geometry in the plane and in space 
Imz 4 


I 
| 
| 
| 
| 
| 
; > 
x Re z 


Figure 8.12. Cartesian coordinates of the complex number z = x + iy 


The modulus (or absolute value) of z = x + iy, denoted |z]|, is the non- 
negative number 


ie es 


representing the distance of (x, y) from the origin; non-incidentally, this definition 
is the same as that of norm of the vector v associated to z, |z| = ||v|]. Moreover, if 
a complex number is real, its modulus is the absolute value as of Sect. 1.3.1. This 
justifies the choice of name, and explains why the absolute value of a real number is 
sometimes called modulus. We point out that, whereas the statement z, < z2 has 
no meaning, the inequality |z,| < |z2| does, indeed the point (corresponding to) z1 
is closer to the origin than the point z2. The distance of the points corresponding 
to z1 and 2 is |z1 — zg]. 
Given z € C, the following are easy: 


jz} >0; |z| =Oif and only if z=0; 
Jz|? = (Rez)? + (Imz)?; 

Ree < |Rez|)<|\2|\,, Paes Paz <a} 5 
|] — [22] < [21 + 2a] < |2a] + [22]. 


Dnz2, Imz , 
Zi + 22 


Rez 


Rez 


21 — 22 


Figure 8.13. Sum (left) and difference (right) of complex numbers 


8.3 Complex numbers 275 


The complex conjugate, or just conjugate, of z = x + iy is the complex 


number 


On the plane, the conjugate Z is the point (x, —y) obtained by reflection of (x, y) 
with respect to the real axis. The following properties hold for any z, 21, z2 € C: 


2=2. a) 12.3 Se = 275 
+29 = 2% 4+20, 2 — 29 = 2 — 2a, 
ALT £2 1 2 1 2 ; ak 2 (8.27) 
Z 
21 22 = 21 22, (2) =2 (ga 0). 
22 22 
Of immediate proof is also 
for all z € C. 
8.3.3 Trigonometric and exponential form 
Let r and 6 be the polar coordinates of the point (x,y). Since 
x =rcosé and yg=rende, 
the number z = (a, y) has a polar form, also called trigonometric, 
z=r(cosd+isin@). (8.28) 


First of all, r = |z|. The number @, denoted by @ = argz, is said argument of 
z (less often, but to some more suggestively, ‘amplitude’). Geometrically arg z is 
an angle (in radians) delimited by the positive real axis and the direction of the 
position vector z (as in Fig. 8.14). 


z=axa+1y 


Rez 


Figure 8.14. Polar coordinates of the number z = x + iy 


276 8 Geometry in the plane and in space 


The argument can assume infinitely many values, all differing by integer mul- 
tiples of 27. One calls principal value of arg z, and denotes by the capitalised 
symbol Arg z, the unique value 6 of argz such that —a < @ < 7; the principal 
value is defined analytically by (8.2). 

Two complex numbers z; = 11(cos@; + 7sin6,) and z2 = rg(cos 62 + isin 62) 
are equal if and only if rj = rg and 61,02 differ by an integer multiple of 27. 

The representation in polar form is useful to multiply complex numbers, and 
consequently, to compute powers and nth roots. Let in fact 


Z1 = 11 (cos 6; + isin 61) and 22 = 12 (cos 62 + isin 2) ; 
the addition formulas for trigonometric functions tell us that 


Z122 =7112 [(cos 01 cos 2 — sin 9, sin 2) + i(sin 01 cos 62 + sin 02 cos 01) 


8.29 
=Pr11T2 [ cos(41 + 62) +isin(O; + 02) . ( ) 
Therefore 
arg (z1 22) = arg 21 + arg ze. (8.30) 
Note that this identity is false when using Arg: take for instance z}3 = —1 = 
cos7 +7sina and zg =i =cos > +isin 5, so 
Zz 2g = —i = cos (— =) + isin (— as 
2 2 
L.€., 
1 3 T 
Are z=, AIg22= 5; Arg z1 + Arg z2 = 57 # ATE 2122 = —5- 


The so-called exponential form is also useful. To define it, let us extend 
the exponential function to the case where the exponent is purely imaginary, by 
putting 

e” = cos6 + isin® (8.31) 


for any 0 € R. Such a relation is sometimes called Euler formula, and can be 
actually proved within the theory of series over the complex numbers. We shall 
take it as definition without further mention. The expression (8.28) now becomes 


z=re” (8.32) 
the exponential form of z. The complex conjugate of z is 
—i0 


Z =r(cos@ —isin@) = r(cos(—@) + isin(—0)) = re 


in exponential form. 
Then (8.29) immediately furnishes the product of z, = rje**! and zg = ree 


102 


8.3 Complex numbers 277 


Thus the moduli are multiplied, the arguments added. To divide complex numbers 
(8.29) gives, with ry = ro = 1, 


eft gt — gil +62) | (8.34) 
In particular, a 
ee? = | 
so e~? is the inverse of e’?. The reciprocal of z = re’ 4 0 is then 
4d a 
Z  =—¢e ' (8.35) 
ia 


Combining this formula with the product one shows that the ratio of 2; = r,e! 
and z = rge*82 + 0 is 
A Oa) (8.36) 
7) 72) 
8.3.4 Powers and nth roots 
Re-iterating (8.33) and (8.35) we obtain, for any n € Z, 
zr a preint | (8.37) 


For r = 1, this is the so-called De Moivre’s formula 
(cos @ + isin0@)” = cosn@+isinné. (8.38) 


By (8.37) we can calculate nth roots of a complex number. Fix n > 1 and a 
complex number w = pe’, and let us determine the numbers z = re’? such that 
z” = w. Relation (8.37) implies 

n n ind 


Cae = pe Su. 


which means 


r’ =p, 
nO=p+2kr, kEZ, 


hence 


The expression of @ does not necessarily give the principal values of the roots’ ar- 
guments. Nevertheless, as sine and cosine are periodic, we have n distinct solutions 


278 8 Geometry in the plane and in space 


Imz p 14 +/3% 


Figure 8.15. The point 1+ \/3i and its fifth roots z;, 7 =0,...,4 


to the problem. These points lie on the circle centred at the origin with radius 
«/p; they are precisely the vertices of a regular polygon of n sides (an ‘n-gon’, see 
Fig. 8.15). 


Examples 8.5 


i) For n > 1 consider the equation 


eae 
Writing 1 = le’? we obtain the n distinct roots 
mae, k=0,1,...,n—1, 
called nth roots of unity. When n is odd, only one of these is real, z9 = 1, whilst 
for n even there are two real roots of unity z9 = 1 and z,/2 = —1 (Fig. 8.16). 
ii) Verify that 
g2=-1 
admits, as it should, the solutions z, = +7. Write —1 = le’”, from which 
ai . ~a+27 rid : 
4.=29=¢2 =1 and 22.=2Z =e 27 |e %2 ]=71; 


Note finally that (8.31) permits to define the exponential of arbitrary (not only 
imaginary) complex numbers z = x + iy, by letting 


(8.39) 


8.3 Complex numbers 279 


Imz Imz 


A 


21 22 21 


Figure 8.16. Roots of unity: cubic roots (left) and sixth roots (right) 


Using (8.34) it is now an easy task to verify that the fundamental relation e*1t*? = 
e*!e*? is still valid in the realm of complex numbers. In addition to that, 


RE z 


le7| =e? >0, arge* = Imz. 


The first tells, amongst other things, that e* ~# 0 for any z € C. The periodicity 
of the trigonometric functions implies 


geen = ge: for all k € Z. 


8.3.5 Algebraic equations 
We will show that the quadratic equation with real coefficients 
az? +bz+c=0 


admits two complex-conjugate solutions in case the discriminant A is negative. 
We can suppose a > 0. Inspired by the square of a binomial we write 


2 2 
0-24 2et Sa (Prater Ty) +5 e 
a a 


2a 4a2) a 4a?’ 
that is . 
=P : ae 0 
* 94 4a? 
Therefore 
b VA 
24+ = 4 ; 
2a 2a 
or 
—b+iv—A 
Z= ‘ 
2a 
—btVA., 
We write this as z = aa analogy to the case A > 0. 
a 


280 8 Geometry in the plane and in space 


The procedure may be applied when the coefficients a 4 0, b and c are complex 


numbers, as well. Thus 
—bt V b2 — 4dac 
2 = ——— 
2a 


are the two solutions of the equation az? + bz + ¢ = 0 in the greatest possible 
generality. 

Third- and fourth-degree algebraic equations have three and four solutions re- 
spectively (counted with multiplicity): these roots can be made explicit via algeb- 
raic operations, namely square and cubic roots!. There can be instead no analytic 
expression for solving an equation of fifth degree or higher. Despite all though, the 
Fundamental Theorem of Algebra warrants that every algebraic equation p(z) = 0, 
where p is a polynomial of degree n with real or complex coefficients, admits ex- 
actly n solutions in C, each counted with its multiplicity. This is how it goes. 


Theorem 8.6 Let p(z) = anz"+...+a1z+4a09, with an £0, be a polynomial 
of degree n with coefficients ag € C,0O<k <n. There exist m < n distinct 
complex numbers 21,---,2%m, andm non-zero natural numbers [11,.--, [bm with 


fit+...+ bm =n, such that p(z) factorises as 


De =p = Fi) oe Bel: 


The numbers zz are the roots of the polynomial p, in other words the solutions of 
p(z) = 0; the exponent px, is the multiplicity of the root z,. A root is simple if it 
has multiplicity one, double if the muliplicity is 2, and so on. 

It is opportune to remark that if the coefficients of p are real and if zp is a 
complex root, then also Zp is a root of p. In fact, taking conjugates of p(zo) = 0 
and using known properties (see (8.27)), we obtain 


0=0= p(zo) = G@nZG +... +4120 + Go = OnZp +... +41%0 + ao = p(Z). 


The polynomial p(z) is then divisible by (z — zo)(z — Zo), a quadratic polynomial 
with real coefficients. 

A version of the Fundamental Theorem of Algebra for real polynomials, that 
does not involve complex numbers, is stated in Theorem 9.15. 


' The cubic equation x?+ax7+bxr+c = 0 for example, reduces to the form y?+py+q = 0, 
by changing x = y — 3; p and q are suitable coefficients, which are easy to find. The 
solutions of the reduced equation read 


2 q Pp alg aa od 
Y= Vat V ato” Vat vat 7 
a formula due to Cardano. Extracting a root yields as many solutions as the order of 
the root (here 2 or 3), yielding a maximum of 12 solutions, at least in principle: it is 
possible to prove that at most 3 of them are distinct. 


8.4 Curves in the plane and in space 281 


8.4 Curves in the plane and in space 


The second part of the chapter sees the return of functions, and the present section 
devotes itself in particular to the notion of a curve in Euclidean space or on the 
plane. A curve can describe the boundary of a planar region such as a polygon, or 
an ellipse; it is a good model for the trajectory of a point-particle moving in time 
under the effect of a force. In Chap. 10 we shall see how to perform integral calculus 
along curves, which enables to describe mathematically the notion of work, to stay 
with the physical analogy. 


Let I be an arbitrary interval of the real line and y : J + R° a map. Denote 
by y(t) = (a(t), y(£), (t)) the point of R® image of t € I under ¥. One says ¥ is a 
continuous map on / if the components 2, y, z : J > R are continuous functions. 


Definition 8.7 A continuous map y: I C R- R? is called a curve (in 


space). The range of the map is called image and will be denoted by the letter 
C= (1) CR’. 


If the image lies on a plane, one talks about a plane curve. A special case is that 
where y(t) = (x(t), y(t), 0), that is, curves lying in the xy-plane which we indicate 
simply as y: I > R?, y(t) = (z(t), y(t)). 

Thus a curve is a function of one real variable, whereas the image is a subset of 
space (or the plane). Curves furnish a way to parametrise their image by associat- 
ing to each value of the parameter t € I exactly one point. The set C’ could be the 
image of many curves, by different parametrisations. For example, the plane curve 


Figure 8.17. Clockwise from top left: images C = +([a,b]) of a simple arc, a non- 
simplearc, a closed arc which is not simple, a Jordan arc 


282 8 Geometry in the plane and in space 


y(t) = (t,t) with t € [0,1] has the segment with endpoints A = (0,0), B = (1,1) 
as image. But this is also the image of 6(t) = (t?,t?), t € [0,1]; the two curves 
7 and 6 are parametrisations of the segment AB. The middle point of AB is for 
example image of t = s under + and t = v2 under 0. 

A curve ¥ is simple if y is a one-to-one map, i.e., if different values of the 
parameter determine distinct points on the image. 

Suppose the interval J = [a,b] is closed and bounded, as in the previous ex- 
amples, in which case the curve + is called an are. An arc is closed if y(a) = -y(b); 
clearly a closed arc is not simple. Nevertheless, one defines simple closed arc (or 
Jordan arc) a closed arc which is simple except for one single point y(a) = y(b). 
Fig. 8.17 illustrates various types of situations. 

The reader might encounter the word arc in the literature (as in ‘arc of circum- 
ference’) to denote a subset of R? or R®, endowed with the most natural — hence 
implicitly understood — parametrisation. 


Examples 8.8 
i) The simple plane curve 


y(t) = (at + b,ct +d), tEeR, a0, 
ad — be 


has for an image the line y = <a + . Setting x = a(t) = at + b and 
a 


a 
x—b 
, SO 


y = y(t) = ct +d, in fact, gives t = 
d—b 
y= “(ga }) tas ee pacer 
a a 
ii) The curve 
+(t) = (x(2), y(t)) = (1 + cost, 3+sint), t € [0,27], 
has the circle centred at (1,3) with radius 1 as image; in fact (x(t) — 1)? + 


(y(t) — 3) = cos?t + sin? t = 1. This is a simple closed curve and provides the 
most natural way to parametrise the circle that starts at (2,3) and runs in the 
counter-clockwise direction. 


In general, the image of the Jordan curve 
yi) = (x(t), y(t)) = (%o + rcost, yo + rsint), t € [0, 27], 
is the circle with centre (xo, yo) and radius r. 


If t varies in an interval (0, 2k7], with k > 2 a positive integer, the curve has the 
same image seen as a set; but because we wind around the centre k times, the 
curve is not simple. 


Instead, if ¢ varies in [0,7], the curve is an arc of circumference, simple but not 
closed. 
iii) Given a,b > 0, the map 

¥(t) = (a(t), y(t)) = (acost, bsint), t € [0,27], 


8.4 Curves in the plane and in space 283 


yi all 
an 
Figure 8.18. The spiral and helix of Examples 8.8 iv), vi) 


is a simple closed curve parametrising the ellipse with centre in the origin and 
semi-axes a and 0. 


iv) The image of 

+(t) = (x(t), y(t)) = (tcost, tsint), t € [0,+00), 
is drawn in Fig. 8.18 (left); the spiral coils counter-clockwise around the origin. 
The generic point y(t) has distance ,/x?(t) + y?(t) = t from the origin, so it 
moves always farther as t grows, making the spiral a simple curve. 
v) Let P = (ap, yp, zp) and Q = (xg, yg, zq) be distinct points in space. The 
image of the simple curve 

y(t) =P+(Q-P)t, teR, 

is the straight line through P and Q, because (0) = P, y(1) = Q and the vector 
+(t) — P has constant direction, being parallel to Q — P. 


The same line can be parametrised more generally by 
t — to 
P= PaO =) 
t1 — to 
where to # t1; in this case y(to) = P, y(ti) = Q. 


teR, (8.40) 


vi) Consider the simple curve 


+(t) = (x(t), y(t), 2(t)) = (cost, sint, t) , teR. 
Its image is the circular helix (Fig. 8.18, right) resting on the infinite cylinder 
along the z-axis with radius one, i.e., the set {(z,y,z) € R?: a7 +y?=1}. oO 


A curve 7: I + R? is differentiable if the components x, y,z : J — R are dif- 
ferentiable maps on I (recall that differentiable on J means differentiable at every 
interior point, and differentiable on one side at the boundary, if this is included in 
I). Let + : I > R® be the derivative function y(t) = (2’(t), y’(£), 2’(£)). 


284 8 Geometry in the plane and in space 


Pat = (to + At) 


Figure 8.19. Tangent vector and secant at the point Po 


Definition 8.9 The curve y: I > R? is regular if it is differentiable over 
I with continuous derivative (i.e., the components are of class C! on I) and 
if y' (t) 4 (0,0,0), for every t € I. 


A curve y: I + R® is said piecewise regular if I is the union of finitely- 
many subintervals where y 1s regular. 


When the curve ¥ is regular and to € I, the vector y'(to) is called tangent 
vector to (the image of) the curve at Py = (to). The name comes from the 
geometric picture (Fig.8.19). Let to + At € I be such that the point Pa; = 
(to + At) is different from Po, and consider the straight line passing through P 
and Pa;. By (8.40) such line can be parametrised as 

t — to ¥(to + At) — (to) 


S(t) = Po + (Pat — Po) ae (to) + ae —to). (8.41) 


As At goes to 0, the point Pa; approaches Pp (component-wise). At the same time, 


t At) — y(t 
the regularity assumption forces the vector o = o(to, At) = “to A) ato) 


to tend to (to). Therefore the limiting position of (8.41) is 


T(t) = y(ta) +77’ (to) E— 40); teER, 


the straight line tangent to the curve at Po. To be very precise, the tangent vector 
at Po is the vector (Po,~‘(to)), but it is common to write it simply ~y‘(to) (as 
discussed in Sect. 8.2.3). One can easily verify that the tangent line to a curve 
at a point is an intrinsic notion — independent of the chosen parametrisation, 
whereas the tangent vector does depend on the parametrisation, as far as length 
and orientation are concerned. 


8.4 Curves in the plane and in space 285 


In kinematics, a curve represents a trajectory, i.e., the position y(t) a particle 
occupies at time t. If the curve is regular, the tangent vector -y'(t) describes the 
velocity of the particle at time t. 


Examples 8.10 
i) All curves in Examples 8.8 are regular. 
ii) Let f : I R be differentiable with continuity on J. The curve 
WH= GLO), tel, 


is regular, and has image the graph of the function f. In fact, 
yn) =F e100), for anytel. 
iii) The are +: [0,2] + R? 


in Tre 10-1), 
os, if t € [1,2], 


parametrises the polygonal chain ABC (Fig. 8.20, left), while 


(t,1), if t € (0,1), 
v(t) = 4 (t,t), if ¢€ [1,2), 
(4—t,2-—4(t-2)), ift € [2,4], 
describes ABCA (Fig. 8.20, right). Both are piecewise regular curves. 
iv) The curves 
i= (1+ V2cost, V2sint) , t € [0,27], 
FH) = (1+ V2.cos 2t, —V2 sin 2), t € (0, a], 


parametrise the same circle C (counter-clockwise and clockwise respectively) 
with centre (1,0) and radius V2. 


A A 
C C 
: ! 
A ‘Bo A ‘B 
Te a oe a oe 


Figure 8.20. The polygonal chains ABC (left) and ABCA (right) in Example 8.10 iii) 


286 8 Geometry in the plane and in space 


They are regular and differentiate to 


7 (t) = V2(—sint, cost) , 7 (t) = 2V2(— sin 2t, — cos 2) . 


The point Po = (0,1) € C is the image under + of to = Sn, under ¥ of the 


value ty = an, Pye = ye) = ¥(to). In the former case the tangent vector is 


(to) = (—1,—-1) and the tangent to C at Po 
T(t) = (0,1) (1, (¢- 4x) =(-t+Sm,1-t+ Sn), teR. 
For the latter parametrisation, (to) = (2,2) and 
TH=O1 40 2t= r) = (2(t- on) ee >) , teR. 


The tangent vectors at Po have different lengths and orientations, but same dir- 
ection. Recalling Example 8.8 in fact, in both cases the tangent line has equation 
y= 1+ @. oO 


8.5 Functions of several variables 


Object of our investigation in the previous chapters have been functions of one 
real variable, that is, maps defined on a subset of the real line R (like an interval), 
with values in R. 

We would like now to extend some of those notions and introduce new ones, 
relative to real-valued functions of two or three real variables. These are defined 
on subsets of R? or R® and valued in R 


f:domf CR¢>R (d=? or 3). 
xtr> f(x). 


The symbol « indicates a generic element of R%, hence a pair x = (x1, 22) ifd = 2 
or a triple x = (#1, 22,23) if d = 3. For simplicity we might write (71,72) = (x,y) 
and (#1,22,2%3) = (x,y,z), and the coordinates of # shall be (a1,...,2%a) when 
it is not relevant whether d = 2 or 3. Each x € R? is uniquely associated to a 
point P of the plane or space, whose coordinates in an orthogonal Cartesian frame 
are the components of x. In turn, P determines a position vector of components 
£1,...,2q, So the element x € R@ can be thought of as that vector. In this way, 
R? inherits the operations of sum x + y = (41 + y1,---,%a + Ya), multiplication 
Aw = (Av1,...,AXa) and dot product «-y = 21y1 +... + aya. Furthermore, the 
Euclidean norm |la|| = \/z? +...+ 22 represents the Euclidean distance of P to 
O. Notice ||a — y|| = ./(a1 — yi)? +... + (aa — ya)? is the distance between the 
points P and Q of respective coordinates x and y. 


8.5.1 Continuity 


By means of the norm we can define neighbourhoods of a point in R? and extend 
the concepts of continuity, and limit, to functions of several variables. 


8.5 Functions of several variables 287 


Definition 8.11 Let x) € R? andr > 0 real. The set 


I,(ao) = {a € R¢: ||x — aol| <r} 


of points R¢ whose distance from ao is less than r is called neighbourhood 
of xo of radius r. 


With ao = (%01,---, oa), the condition ||a — x|| <r is equivalent to 
(x4 — %91)° + (#2 — £02)? <r? id=2, 
(ay a to1)? + (x2 _ to2)° + (x3 _ x03)" <r? i Ss 


Therefore [,.(a9) is respectively the disc or the ball centred at a with radius r, 
without boundary. 


Defining continuity is formally the same as for one real variable. 
Definition 8.12 A function f : dom f C R¢ > R is continuous at 2 € 
dom f if for any ¢ > 0 there exists 6 > 0 such that 

Va € dom f, laz-—apl|<d6 => (|f(a)— f(ao)|<e. 


Otherwise said, if 


Va € dom f, t€Is(ao) => f(x) € I-(f(xo)). 


Example 8.13 
The map f :R? —R, f(x) = 221 + 522 is continuous at x = (3,1), for 
|f (x) — f(@o)| = |2(a1 — 3) + 5(a2 — 1)| 
< 2\a1 — 3] + 5|aq — 1] < 7\|a — aol. 
Here we have used the fact that |y;| < ||y|| for any i=1,...,d and any y € R%, 


a direct consequence of the definition of norm. Given ¢ > 0, it is sufficient to 
choose 6 = €/7. 


The same argument shows that f is continuous at every ao € R?. 


A map f : dom f C R? > R is continuous on the region 2 C domf if it is 
continuous at each point a € 22. 


The limit for « + ao € R®@ is defined in a completely similar way to the one 
given in Chap. 3. 


288 8 Geometry in the plane and in space 
8.5.2 Partial derivatives and gradient 


Let f : dom f C R? > R be a function of two variables defined in a neighbourhood 
of ao = (Xo, yo). Now fix the second variable to obtain a map of one real variable 
x defined around zp € R 


xy f(x, yo). 


If this is differentiable at 29, one says that the function f admits partial deriv- 
ative with respect to x at xo, written 


Similarly, if y 4 f(xo,y) is differentiable at yo, one says that f admits partial 
derivative with respect to y at xo 


If both conditions above hold, f admits (first) partial derivatives at ao, and there- 
fore the gradient vector of f at xo is well defined: this is denoted 


of 


Vf (ao) = = (x0) (x0)) ER. 


oF 
zi Oy 
In the same fashion, let f : dom f C R° + R be a function of three variables 


defined around ao = (20, yo, 20); the (first) partial derivatives at ao with respect 
to 2, y, 2 are 


d 
gat (21 Yo. 2) 


d 
at to Y, 20) 


d 
quit (0, YO; Zz) 


assuming implicitly that the right-hand-side terms exist. The gradient of f at ao 
is the vector 


8.5 Functions of several variables 289 


Examples 8.14 


i) Let f(a, y) = \/x?2 + y? be the distance function from the origin. Considering 
Zo = (2, —1) we have 


Of d x 2 
oe Cy eg ey os hn rn 
aan ) Ge 7 +1)(2) G+1\p-2 V5 
Of d y if 
— (2,-1) = (— 44+ y?)(-1) = = =—-—. 
. Oy (G V4ty}, vs 
2 1 1 
Vf(2,-1) = (—, -—) = —(2,--1). 
(OD ee ee 
ii) For f(z, y, z) = ylog(2x — 3z) we have, at ao = (2,3, 1), 
Of d 2 
Bx | oe) (sy Slow(2e 3))(2) 23) 5 
Of d 
5y (1) = (F vlog) (8) =0, 
of _¢d as. _ 
=. (2,3,1) = ($= Slog(4 — 32)) (1) ayer es 
thus 
V f(2,3,1) = (6,0, -9). 
Set x = (21,...,@q). The partial derivative of f at ao with respect to the 
variable x;, 71 = 1,...,d, is often indicated also by 


Dz, f (xo) or fe, (x0) : 


The function 


OF 2 a OF 


defined on a subset dom oe C domf C R? with values in R, is called partial 
derivative of f with respect to x;. The gradient function of f, 


Vi: 2HVf(x), 


is defined on the intersection of the domains of the partial derivatives. The gradient 
is an example of a vector field, i.e., a function defined on a subset of R@ with values 
in R¢ (thought of as a vector space). 


Examples 8.15 


Let us look at the previous examples. 


i) The gradient of f(x,y) = Vx? + y? is 


290 8 Geometry in the plane and in space 


x y x 
Ve(x) = (2, 4.) == 
) (es aes) |x| 
and dom Vf = R? \ {0}. 
ii) For the function f(z, y, z) = y log(2x — 3z) we have 


2y —3y 
Vila) = (5 losl2e - 32), 5) , 


so dom Vf = dom f = {(z, y, z) € R? : 2x — 3z > O}. 


Partial derivatives with respect to x;,7 = 1,...,d are special directional deriv- 
atives, which we discuss hereby. Let f be a map defined around a point a € R?@ 
and suppose v € R? is a given non-zero vector. By definition, f admits (partial) 
derivative at x in the direction v if the quantity 


exists and is finite. Another name is directional derivative along v, written 
Dy f (Zo). 

The condition expresses the differentiability at to = 0 of the mapt > f(ao+tv) 
defined around to (because if t is small enough, xo + tv is in the neighbourhood of 
xo where f is well defined). The curve t> a +tv = y(t) is a parametrisation of 
the straight line passing through xg with direction v, and (f o-y)(t) = f(ao + tv). 
The directional derivative at ao along v is therefore 


FF (a9) = (S fer) (0). 


Let e; be the unit vector whose ith component is 1 and all others zero (so 
e; = 1, €g = j, e3 = k). Taking v = e; gives the partial derivative at a with 
respect to 2x; 


For example, let d = 2 and i = 1: from 


f (wo + ter) = f((ao, yo) + #(1,0)) = f(wo +t, yo) 


we obtain, substituting 7 = xp +t, 


—_ 
=r 


bcacies 7 f(xo +t, yo) — f (xo, yo) 
Oe; a ¢ 
f(x, yo) — f(xo, yo) _ Of 


= in ——————_ = — (x, . 
T+29 XL — Xo Ta 0» Yo) 


8.6 Exercises 291 


It can be proved that if f admits partial derivatives with respect to every 
variable x; in a whole neighbourhood of ao, and if such maps are in this neigh- 
bourhood continuous, then f admits at xo derivatives along any vector v ¥ 0; 
these directional derivatives can be written using the gradient as follows 


From this formula we also deduce the useful relations 


of 
Ox; 


(ap) =e;- Vf (xo), (= lyarayitls 


Under the same assumptions on f, if y : J > R® is any differentiable curve 
at to € J such that +(to) = a, the composite map (f 0 y)(t) = f(7(t)) remains 
differentiable at ty and 


(S for) =") VF la): (8.42) 


this should be understood as a generalisation of the chain rule seen for one real 
variable. 


Example 8.16 


Consider the distance function f(x,y) = \/x2 + y? and let y : (0,+00) + R? be 
the spiral -(t) = (tcost,tsint). Since 


f(y(t)) = Vt? cos?t +t? sin?t =¢t, 


d 
we see directly that a f(v(t)) = 1 for any t > 0. Let us double-check the same 


result using (8.42). Define x = y(t) and the unit vector « = eal = (cost, sint). 
Then y'(t) = (cost, sint) + t(—sint,cost) = @+t#~; the notation for the unit 
vector @- = (—sint,cos t) is due to g@-.@ = 0. We already know though 


(Example 8.15) that V f(x) = x for any x 4 0. Therefore 
4/(t)- Vif (@) = (@+téT)-@=&-&4+te 7 & = ||a||? =1, 


as expected. 


8.6 Exercises 


Determine the polar coordinates of the following points in the plane: 
A= (5V6,5V2) , B= (6v6,-5V2), 
C = (-5V6,5v2), D = (-5V6, —5V2). 


292 8 Geometry in the plane and in space 
2. Write the following points of the plane in polar coordinates: 
a) A=(-5,0) b) B= (0,4) c) C=(0,-3) 


3. Determine the polar coordinates of the following points (without computing 
explicitly the angle): 


A= (23 — 3V2, 1) b) B= (3V2 — 273, 3v2 + 2v3) 


Determine the polar coordinates of the following points in the plane (leaving 
the argument written in terms of a trigonometric function): 


A= (cos =, sin =), B=(- cos =, sin =) C = (sin =, cos £). 


5. Change to polar coordinates: 
A= pe - 28 V2 ot + 2 
= 9 2 9’ 2 9 9 
28 28 
b) B= (2cos 9m 2sin 97) 


Given v; = (1,0, —2) and v2 = (0,1,1), find a real number X so that v1 + Av2 
is orthogonal to v3 = (—1,1,1). 


Describe the set of planar vectors orthogonal to v = (2,—5). 


Determine the set of vectors in space orthogonal to v, = (1,0,2) and vg = 
(2, —1,3) simultaneously. 


Find the norm of the vectors: 


v1; = (0, V3,7), ve =(1,5,-2), v3= (cos =,sin = cos =, —sin F sin 5). 


10. Determine the cosine of the angle formed by the following pairs: 


a) v=(0,1,0), w= (0,%,2) b): = 0,2.—1), s= (1,11) 


Determine the unit vector u corresponding to w = (5,—3,—\/2). Then find 
the component of v = (2, —1,2V2) along u and the orthogonal one. 


12. Write the following complex numbers in Cartesian form: 

a) (2—3i)(-2+4) b) (3+4)(3 —4) (§ + qe) 
14+2i 2-i 5 
eras (2-1 


8.6 Exercises 


13. Determine the trigonometric and exponential forms of: 


a) 2=1 b) z=-1 

Gg) 2=1% d) z=i(1+2) 
er 

e) z= ac f) z=sina+icosa 


L=4 


14. Compute the modulus of: 


1 21 a 
— = 1 —_ 

a ae ea Ty 

32 —1 ; 
Prove that = laf |g) = 1: 
3+ 1z 
16. Solve the following equations: 
a) 22-2z+2=0 [b)] 22 +3iz+1=0 


z|z| -2z+71=0 


d). |g/Pa° =a 


e) 27+4+iz=1 e = 2/7 


293 


Verify 1+ 7% is a root of the polynomial z* — 5z° + 10z2 — 10z + 4 and then 


determine all remaining roots. 


9. 229 when: 


a) a= [b)] z= 


18. Compute z?, z 


1 


a 
V3-i i 


19. Write explicitly the following numbers in one of the known forms and draw 


their position in the plane: 


a) 2= Wi z= 71 
20. Determine the domains of the functions: 

f(z,y) = — 

f(a,y) = J/1- 3ay 

fey) = /BeF y+ 1 — Go 


f(x,y, 2) =log(a? +y? + 27-9) 


Cc) z=V2-2% 


294 8 Geometry in the plane and in space 


21. Calculate all partial derivatives of: 


a) f(x,y) = V3a+y? at (xo, yo) = (1,2) 


b) fey, z) = yer TY? at (Xo, Yo, zo) = (0, A =1) 


22. Find the gradient for: 


a) F(e.y) = arctan 4 b) f(x,y) = (w+ y) log(2x — y) 
c) f(x,y, z) = sin(a + y) cos(y — z) d) f(z,y,z) =(a+y)* 


23. Compute the directional derivatives of the following maps along the vector v 
and evaluate them at the point indicated: 


a) f(x,y) =a2V/y-3 v = (-1,6) gy = (2212) 
b) f(a, Y; z) = se io (12, —9, —4) to = (1, 1, =1) 


8.6.1 Solutions 


1. All four points have modulus r = 25-6 + 25-2 = 5/8. Formula (8.2) yields, 
for A, 


r ' ov 2 : 1 
= arctan —~ = arctan —= = — 
7 5V6 V3 6 
since x > 0. Similarly for B 
Og = arctan (— : ) =—arctan—= =-4 
° V3 Va 6 
For the point C, since x < 0 and y > 0, 
7 = arctan (— : ee eee 
eG /3 —_ 6 6 ’ 
while x < 0,y < 0 for D, so 
0 arcta : Z 
= arctan ~=-7 = --T=-- 
= VB 6 6 


2. Polar coordinates in the plane: 


a) r=5, C= 7 b) r=4, 0=§; ec) r=3, O=-f. 


8.6 Exercises 295 


3. Polar coordinates: 


a) The modulus is r = 31 — 12\/6. From 2\/3 < 3/2 we have 


1 2 
6 = arctan ——————=. + 7 = arctan 2Vvi3 + 3v2 +7 
2V3 — 3V2 =6 
V3 J/2 


= arctan (“* + )+n 


b) r=5V6, 6 =arctan(5 + 26). 
4. All points have unit modulus r = 1. For A 
1 
$4 = arctantan ae 


For B, x < 0 and y > 0, so 


05 = arctan (— tan ~) + 


As for C, 


COS 5 
Oc = arctan aT 3 
sin = 


9 
by (2.17), and since the tangent function has period 7, it follows 


cos $ sin(t + $) 1 rf 

: = —-——) = ~— tan a7 = —tan(—- 7.7) =tan—7, 

sin cos($ + $) 18 ( 18 ) 18 
hence 0¢ = in 


5. Polar coordinates: 


a) Just note v2 = sin | = cos 4 and apply the addition formulas for sine/cosine: 
13 13 
A= (cos (+ + <),sin (+ + *)) = (cos 367 sin — ; r) ; 


Because 4 on < 5, we immediately have r= 1 and 6 = eT. 


b) r=2, 0=-8n 

6. The vectors v1 + Av2 and v3 are orthogonal if (v; + Av2) - v3 = 0. But 
(v1 + Ave): v3 = U1 + U3 + Avg: v3 = —34 2A, 

whence A = 3 follows. 


7. A vector (x,y) is orthogonal to v if (x, y) - (2, —5) = 2a — 5y = 0. The required 
set is then made by the vectors lying on the straight line 2” — 5y = 0. One way to 
describe this set is {A(5, 2): A © R}. 


296 8 Geometry in the plane and in space 


8. Imposing w = (x,y, z) orthogonal to v; and v2 yields w- v1 = x + 2z =0 plus 
w-vo = 2x —y+3z = 0, hence x = —2z and y = —z. Put z = X, and the set 
becomes {A(—2,—-1,1): A € R}. 


9. lvl =v52, — |[val]=v30, — |lvsi| = 1. 


10. Angles between vectors: 


a) cosd=4; b) cos? = 1. 


11. From ||w|] = 6 it follows u = (2,—3, —¥2), Since v- w = 3 


=) 
De 42 3 19 
a =f Oy) a (> = a) ee  ), 
Vyut ( ’ ’ V2) (7 4” 4 ) (F 74¥?) 
12. Cartesian form of complex numbers: 
a) —14 8; b) 2+3; c) -2; d) $i. 
13. Exponential and trigonometric form: 
TT “a TT ; ee ) 
a) z= cos > tisin> =e'? ; b) z=cosam+isina7 =e"; 
Tm 3 3 
a 2= V2(cos a +i sin *) =VJ2e't; d)z= V2( cos grt sin 77) = /2e!47 ; 
e) cos~+isin — = e!7 ; f) cos (= — a) +i sin (= - a) = ef F-9) | 
2 2 , 2 2 


14. Modulus of complex numbers: 
a) 8: b) f#- 
15. We proceed indirectly and multiply first the denominator by |z| (= 1) to get 


32-1 
3z +72 


32-1 
3+7z 


16. Solving equations: 


a) Z=12. 
b) The formula for a quadratic equation gives 


—sitJV/-9-4 —-3tetV1380 -32V18 
£- eC SO PSN """"""—_ — 
2 2 2 


c) 


8.6 Exercises 297 


Write z = x + iy, so that the equation reads 


(a + iy) Vu? + y? — 2a — iy+i=0, 


or equivalently, 


w/a? + 2 on +i (ya? + 7 2y +1) =e: 


The real and imaginary parts at the two sides of the equality must be the 


same, so 
x (Va +y? - 2) =0 
yv/a2 +y2 —2y+1=0. 


The first equation in the system implies either x = 0 or ,/x? + y? = 2. Substi- 
tuting 2 to the square root in the second equation gives 1 = 0, which cannot 
be. Therefore the only solutions are 


zr=0 
yly| —2y+1=0. 


Distinguishing the cases y > 0 and y < 0, we have 


zr=0 4 xz=0 
y2—-2y+1=0, ™ —y? —2y+1=0 


sO 


t= a £=0 
y=1 3 y=—-1+v2. 


In conclusion, the solutions are z = i, z = i(—1—/2) (because y = —1+- V2 > 0 
must be discarded). 


V2 v7 1 “iT. 


z=+--(1+%); e) ee eg re 


Using |z|? = zz, the new equation is 
Pages => z7(¢— 27) =0. 
One solution is certainly z = 0, and the others satisfy z — 72 = 0. Write 


z=2x-+1y, so to obtain 
eee 


2ry+y=O0. 


The bottom relation factorises into y(2z + 1) = 0, hence we have two subsys- 


tems 
y= w= -%5 
x(a —1)=0, yo Ss. 


Putting real and imaginary parts back together, the solutions read 


298 8 Geometry in the plane and in space 


17. The fact that the polynomial has real coefficients implies the existence of the 
complex-conjugate Z = 1—i to z = 1+ias root. This means (z—1—1)(z—1+1) = 
2? — 2z +2 divides the polynomial, indeed 


2* —529 +1027 —102+4 = (2? —2z2+2)(27 -—3z24+2) = (27 -274+2)(z-1)(z-2). 
Thus the roots are 


z=141, z=1-1, z=l1, 22. 


18. Powers of complex numbers: 

A) 2° = 20, 2° = -16(1+4), i ae 

b) Rationalising the denominators yields 
V3 +i 


= 7 5 


Now write the number in exponential form 


from which 


i ee eee 
= COS — 7sin—- =72 
2 2 ; 


| 
20 — 2074 ari = (1 + V3i) ; 


x 
| 
fo) 


x 
| 

fo) 
for) 
| 

fo) 


19. Computing and drawing complex numbers: 
a) Z = 1, z= —3(V3 +24), zo = $(V3—-1) 
They are drawn in Fig. 8.21, left. 


b) Write the number 1 as 1 = e°”*. Then because e*+?" = e%, we have 


2 . 4 . _ 4,3 — Dies 
mH=1;,. 4=65", wm Hee", 22=6 3", seHe 5, 
see Fig. 8.21, middle. 
a atin ei, al ; 
c) 29 = W8es™, z1 = W8e-8™ are represented in Fig. 8.21, right. 


20. Domain of functions: 


a) The domain is {(z,y) € R? : « #4 y?}, the set of all plane points off the 
parabola x = y?. 


8.6 Exercises 299 
Imz, Imz, Ime, 


Z1 


ad 


Rez Zo Rez "Re z 


21 
21 22 


ZA 


Figure 8.21. From left: cubic roots of —1i, fifth roots of unity, square roots of 2 — 22 


b) The map is well defined where the radicand is non-negative, so the domain is 


1 1 
{(a,y) € Ry <= ife>0,y> = ife <0, ye Rife =O}, 
2 2 


the set of points lying between the branches of the hyperbola y = x. 
c) Only for 3x +y+1>0 and 2y — 2 > 0 the function is defined, which makes 


{(x,y) € R? sy > —3e— 1} {(@,y) € R? sy > 5} 


the domain of the function, represented in Fig. 8.22. 


d) The map is well defined where the logarithm’s argument is positive, hence the 
domain is the subset of space 


{(2,y,z) € R°: 2727+ y?+2?>9}. 


These are the points outside the sphere centred at the origin and with radius 
three. 


y=—3sxr—1 _ 


Figure 8.22. The domain of the map f(x,y) = /3% +y+1-— Te 


300 8 Geometry in the plane and in space 


21. Partial derivatives: 
O 3 O 2 
as f 


a) Oat) Ga fe Ge a 
b) S£(0,1,-1) =e", SH (0.1,—1) =0. sL(0,1,-1) <0", 


22. Gradients: 


Yy x 
») View =(-z4o. sea). 


b) Vile) = (loge — y) + EY og(2e—y) - ¢*#). 


20 (Oe — y 
cos(x + y) cos(y — z) , cos(z + 2y — z), sin(x + y) sin(y — z)). 
a(a+y)?", 2(a ty)", (e +y)* log(z + y)) . 


a a 


23. Directional derivatives: 


a) A fey=-1; —) FA (mo) = 


Be 


Integral calculus I 


Integral calculus tackles two rather different issues: 


i) Find all functions that differentiate to a given map over an interval of the real 
line. This operation is essentially an anti-derivation of sorts, and goes by the 
name of indefinite integration. 


ii) Define precisely and compute the area of a region in the plane bounded by 
graphs of maps defined on closed bounded intervals, known as definite integ- 
ration. 


The two problems seem to have little in common, at first sight. The outcome of 
indefinite integration is, as we shall soon see, an infinite set of functions. Definite 
integration produces instead a number, the surface area of a certain planar region. 
A cornerstone result, not casually called the Fundamental Theorem of integral 
calculus lest its importance goes amiss, states that the two problems are actually 
equivalent: if one can reconstruct a map knowing its derivative, then it is not hard 
to find the area of the region bounded by the derivative’s graph and the lines 
parallel to the coordinate axes, and vice versa. 


The beginning of the chapter is devoted to the former problem. Then, we 
explain two constructions of definite integrals, due to Cauchy and Riemann; albeit 
strongly related, these are presented as separate items for the didactic purpose of 
keeping the treatise as versatile as possible. Only in later sections we discuss the 
properties of integrals in a uniform manner. Eventually, we prove the Fundamental 
Theorem of integral calculus and show how it is employed to determine areas. 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_9, 
© Springer International Publishing Switzerland 2015 


302 9 Integral calculus I 


9.1 Primitive functions and indefinite integrals 


Let f be a function defined on some interval I. 


Definition 9.1 Each function F, differentiable on I, such that 


IGG Va eI, 


is called a primitive (function) or an antiderivative of f on I. 


Not any map defined on a real interval admits primitives: not necessarily, in 
other words, will any function be the derivative of some map. Finding all maps that 
admit primitives on a real interval, which we call integrable on that interval, is 
too-far-reaching a problem for this book’s aims. We limit ourselves to point out an 
important class of integrable maps, that of continuous maps on a real interval; 
the fact that continuity implies integrability will follow from the Fundamental 
Theorem of integral calculus. 


Examples 9.2 


i) Given the map f(x) = x on R, a primitive function is F(x) = $2”. The latter 
is not the only primitive of f: each map G(x) = $x? +c, where c is an arbitrary 
constant, is a primitive of f, because differentiating a constant gives nought. 


ii) Consider f(x) = + over the interval I = (—oo,0). The collection of maps 
F(a) = log |x| +c (c € R) consists of primitives of f on I. 


The previous examples should explain that if F(a) is a primitive of f(x) on 
the interval J, then also maps of type F(a) + c, with c constant, are primitives. 
It becomes therefore natural to ask whether there are other primitives at all. The 
answer is no, as shown in the next crucial result. 


Proposition 9.3 If F and G are both primitive maps of f on the interval I, 
there exists a constant c such that 


Va el. 


Proof. Take the function H(z) = G(x) — F(x) and differentiate it 
H' (x) = G'(x) — F'(x) = f(x) — f(x) =0, Va el. 


Thus H has zero derivative at every point of J, and as such it must be 
constant by Property 6.26. 


Summarising, the following characterisation of the set of primitives of f holds. 


9.1 Primitive functions and indefinite integrals 303 


Theorem 9.4 Let f be an integrable map on I and F a primitive. Then any 


primitive of f is of the form F(a) +c, with the constant c varying in R. 


That in turn motivates the following name. 


Definition 9.5 The set of all primitives of f on a real interval is indicated 


by 
ir SiGe es 


(called indefinite integral of f, and spoken ‘integral of f(x) dx’). 


If F' is a primitive then, 


It has to be clear that the indefinite integral of f is not a number; it stands rather 
for a set of infinitely many maps. It is just quicker to omit the curly brackets and 
write 


which might be sloppy but is certainly effective. 


Examples 9.6 
i) The map f(x) = x* resembles the derivative 52+ = D°, so a primitive of f 
is given by F(a) = ex" and 
IL 
[ota = al ae 


ii) Let f(x) = e?*. Recalling that De? = 2e?", the map F(x) = $e?” is one 
primitive, hence 


1 
[@ dx = =e?" +c. 
2 
= ut 


iii) As Deos5a = —5sinda, f(x) = sin5z has primitive F(a) = —; cos5a, and 


1 
[ suscae = -§ cos 5x + ¢. 


iv) Let 
—sing ifx<0O, 


fe) =sin|e| = { 


sin x if eS 0. 


304 9 Integral calculus I 


We adopt the following strategy to determine all primitive maps of f(x) on R. 
We split the real line in two intervals I; = (—00,0), Iz = (0,+00) and discuss 
the cases separately. On J;, primitives of f(x) are of the form 

Fi (xz) =cosx+c1 with c, € R arbitrary; 


similarly, on Jz, a primitive will look like 


Fo(x) = —cosx+ ce co € R arbitrary. 
The general primitive F(x) on R will be written as 
Fi(z) ifx <0, 
Cee ee ae 
Fo(x) ifa>0. 
Moreover F' will have to be continuous at x = 0, because a primitive is by mere 


definition differentiable — hence continuous a fortiori — at every point in R. We 
should thus make sure that the two primitives agree, by imposing 


lim F(x) = lim F(z). 
207 x—0t 
As F\ and F% are continuous at x = 0, the condition reads F\(0) = F (0), that 
is 
Ie = = Lee. 
The relation between c,,c2 allows to determine one constant in terms of the 
other (coherently with the fact that each primitive depends on one, and only 


one, arbitrary real number). For example, putting c; = c gives cp = 2+. The 
expression for the general primitive of f(x) on R is then 


F(2) = { 


cosz +c ie 0, 
—cost+2+c ifa>0. 


Theorem 9.4 states that the graph of a primitive of an integrable map is the 
vertical translate of any other (see Fig. 9.1). 

How to select a particular map among all primitives of a given f then? One 
way is to assign a value yo at a given point zp on J. The knowledge of any one 
primitive F'(a) determines the primitive G(x) = F(a) + co of f(x) whose value at 


Figure 9.1. Primitives of a given map differ by an additive constant 


9.1 Primitive functions and indefinite integrals 305 
xo is precisely yo. In fact, 
G(xo) = F(xo) + co = Yo 
yields co = yo — F'(ao) and so 


G(a) = F(x) — F(@o) + yo- 


The table of derivatives of the main elementary maps can be at this point read 
backwards, as a list of primitives. For instance, 


(a4 -1) 


(or Wor 7 0) 


(9.1) 


ee dx = arctanzx+c 
x 


dx = arcsinz +c 


Examples 9.7 

i) Determine the primitive of f(x) = cosx with value 5 at x9 = 3. The map 
F(x) = sina is one primitive. We are then searching for a G(x) = sinx + co. 
Imposing G5) = 5 we see co = 4, and the required primitive is 

G(x) = sing + 4. 
ii) Find the value at x, = 3 of the primitive of f(x) = 6x? + 5x that vanishes at 
the point zo = 1. One primitive map of f(z) is 

5 
F(z) = 227 + 50 


If G(x) = F(x) + cp has to satisfy G(1) = 0, then co = —3, whence 


5 9 
G(x) = 223 + =? — =. 


2 2 
The image of 7; = 3 is G(3) = 72. 


306 9 Integral calculus I 


iii) Consider the piecewise-defined map 
BE ie <l, 
He)={0,_ ays ife >. 
Mimicking Example 9.6 iv) we obtain 


$x? + cy ioe 1. 
F(@)= 97 
ee 2) sey abe. 


Continuity at x = 1 forces $ +e=- 5 + cg. From this relation, writing cy = c 
gives 
F(x) Te ite < i, 
4 i = 
s(@—2)2 +2 +c ife>l. 


Let us find the primitive of f(x) with zero x9 = 3. Since xq > 1, the second 
expression of F(x) 


1 
F(3) = 3(3-2)°+-+e=0 
tells c=-f. It follows 
Le ee shige all 
F(x) =) 4 a a 
eH 2 a5 ie, 


Beware that it would have been wrong to make ue +c vanish at x9 = 3, for 
this expression is a primitive only when x < 1 and not on the entire line. 


Determining the primitive of f(x) that is zero at x9 = 1 does not depend on the 
choice of expression for F(x), because of continuity. The solution is 


$x — 4 i oe 
P(@)y=4 5 . 
3 


9.2 Rules of indefinite integration 


The integrals of the elementary functions are important for the determination of 
other indefinite integrals. The rules below provide basic tools for handling integrals. 


Theorem 9.8 (Linearity of the integral) Suppose f(x), g(x) are integ- 
rable functions on the interval I. For any a, 3 € R the map af(x) + Bg(2) is 
still integrable on I, and 


/ (af (2) rs Bg(2)) da: = af f(a) da+ 6 f ate) dx. (9.2) 


9.2 Rules of indefinite integration 307 


Proof. Suppose F(x) is a primitive of f(x) and G(x) a primitive of g(x). By 
linearity of the derivative 


(oF (2) + B(x) (Pte iwiaerGae, Wet. 


This means aF'(x) + GG(zx) is a primitive of af(x) + Bg(x) on I, which is 
the same as (9.2). 


The above property says that one can integrate a sum one summand at a time, 
and pull multiplicative constants out of the integral sign. 


Examples 9.9 
i) Integrate the polynomial 4x? + 3x — 5. By (9.1) a) 


[Ge +3r-s)dz=4 far+3 f rae fae 
1 1 
—4 (F0* +a] +3 (50? +2) — 5(a + c3) 


_ = 3, 3.9 
= 30 + 5% ox + ¢. 


The numbers c1, C2, c3 arising from the single integrals have been ‘gathered’ into 
one arbitrary constant c. 


ii) Integrate f(x) = cos? x. From 
1 
cos? @ = 5 + cos 2x) 
and D sin 2x = 2cos 22, it follows 


1 1 1 1 
[ cos? ax = 5 f d+ 5 f cos2ede = - z sin 2a +c. 


Similarly 


1 1 
[swear = 52 — zsin 2a +c. 


Theorem 9.10 (Integration by parts) Let f(x),g(x) be differentiable 
over I. If the map f'(x)g(x) is integrable on I, then so is f(x)g'(x), and 


/ f(a)g/(e) de = f(2)9(2) — / f'(a)g(a) de. (9.3) 


308 9 Integral calculus I 


Proof. Let H(«) be any primitive of f’(x)g(x) on I. By formula (6.4) 


/ 


[f(x)g(x) — H(a)]’ = (f(x)g(@)) — A(x) 


= f'(x)g(a) + f(x)g"(x) — f'(2)9(2) 

= f(z)g'(z). 
Therefore the map f(z)g(x) — H(z) is a primitive of f(x)g’(z), exactly 
what (9.3) claims. Oo 


In practice, one integrates a product of functions by identifying first one factor 
with f(x) and the other with g'(x); then one determines a primitive g(x) of g’(x) 
and, at last, one finds the primitive of f’(x)g(a#) and uses (9.3). 

Examples 9.11 
i) Compute 


[re dz. 


Call f(a) = ax and g/(x) = e”. Then f’(x) = 1, and we conveniently choose e” as 
primitive of itself. Formula (9.3) yields 


[vetae=aet — fo de = ne" — (6 +0) = (- Ie +e. 


Since the constant of integration is completely arbitrary, in the last step the sign 
of c was flipped with no harm done. 


Had we chosen f(x) =e” and g/(x) = x (f’(x) =e* and g(x) = $27), we would 


have ended up with 
1 1 
[re dz = sre 5 [oe dz, 


which is not particularly helpful (rather the opposite). 


/ log x da. 


Let us put f(x) = log and g’(x) = 1, so that f’(x) = 1, g(a) = 2. Thus 


i 
preede =slogx— f “2de=zlogx~ faz 
Ws 


= slogx —(x+c)=a2(logx—1)+¢, 


S= fe sin x dz. 


We start by defining f(z) = e” and g’(#) = sing. Then f’(x) = e”, g(x) = 
—cos x, and 


ii) Determine 


given that c is arbitrary. 


iii) Find 


i -e* cose + fe cos dr, 


9.2 Rules of indefinite integration 309 


Let us integrate by parts once again, by putting f(x) = e” and g’(x) = cosa 
this time. Since f’(r) = e”, g(x) =sinz, 


c= =e" cosa +e" sing fe sing dz = e”(sinx — cosx) — S. 
A primitive F(x) of e* sina may be written as 
F(x) = e*(sinz —cosx) — G(z), 


G(«) being another primitive of e” sin xz. By the characterisation of Theorem 9.4 
then, 


2S =e*(sinx —cosz) +c 
hence 


1 
= 5° (sine —cosz) +e. 


Theorem 9.12 (Integration by substitution) Let f(y) be integrable on 
the interval J and F(y) a primitive. Suppose p(x) is a differentiable function 
from I to J. Then the map f (y(x))¢'(a) is integrable on I and 


i f(p(a))¢'(w) dx = F(y(a)) +6, (9.4) 


which is usually stated in the less formal yet simpler way 


J te@)e'@ax= f Fev. 


Proof. Formula (6.7) for differentiating a composite map gives 


Fle) = Fo) LO = f(e@)¥'. 
Thus F'(y(z)) integrates f(y(x))y'(x), i-e., (9.4) is proven. ai 


We insist on the fact that the correct meaning of (9.5) is expressed by (9.4): 
the integral on the left is found by integrating f with respect to y and then 
substituting to y the function v(x), so that the right-hand side too depends on the 
variable «. Formula (9.5) is easy to remember with a trick: differentiate y = y(z), 
so that ou = y’(x). Viewing the right-hand side as a formal quotient (in Leibniz’s 
notation), multiply it by dz; substituting dy = y’(x)dz in one integral yields the 
other. 


Examples 9.13 


i) Determine 


i ve” de. 


310 9 Integral calculus I 


Let y = p(x) = 2”, so y'(x) = 22. Then 


1 1 1 
[ect ae = 5 fev aed = 5 fet dy = Fe" +e 


Going back to x, 
2 Le 
[ee n= 5° a 


i tanz dz. 


and (cosz)’ = — sina. Put y = p(x) = cosz: 


ii) Compute 


sin v 
First, recall tan 2 = 
COS & 


1 1 
[tencaz=- f (cos:r)' dx =~ f ay 
COS & y 


= —log|y| + c= —log|cosz| +c. 


iii) Find 


1 
———._d 
lee e 


By (6.18) it follows directly 


l “4 
———. dr = sinh r+ c. 
/ V14+2? 
Alternatively, we may substitute y = y(x) = V1 4+ 2? — 
x x—-vV1+4+2? 
dy = | ——._- 1 }] dz = — dz, 
1+ 2? JV1+ x? 


1 
hence Pe) dx = —-—dy. This gives 
y 


1 1 
| ete — | Gav =~ logy +e = — Noe VT a? - 2) +0, 


where the absolute value was removed, as /1+ x22 — a > 0 for any x. 
The two expressions are indeed the same, for 


—log( V1 + 2? — 2) = log(/1 +2? +2) =sinh™* a 


iv) The integral 


/ 1 

——- dr 

Vx2 —1 

can be determined by the previous technique. The substitution y = y(x#) = 


Vx? —1— 2 gives 


dz = log|V a? —1+2a|+c. 
| yeaa bslv 


9.2 Rules of indefinite integration 311 


v) The integral 
S= } V1l+2?dz 


is found as in example iii). Integrate by parts with f(#) = /1+ 2? and g'(x) = 1, 
: x 
so f'(2) = ———==, g(x) = x and 
2 
x 
s=sVi+e- fa 
aL ee 


=r¢vV14+2? - {vir dx + 
=rVl1l4+2?-S8 


ge 1 
g=aVltia2— | ——dsz 
V1+ 2x7 


/ 
| aE 


Therefore 


25 =m/ 1+ 24 g=aV/1l+a?+log(V1t+224+2)+¢, 


1 
+ | a 
ee 


and eventually 
1 1 
S= 52 1+ a? + 5 log( l+a?+2)+c 


Similar story for / Vv x2 —1dz. 


vi) Determine 
s= f Vi-#Par. 


As above, we may integrate by parts remembering dx = arcsinz+c. 


l= 


Namely, with f(x) = V1 — 22, g'(x) = 1, we have f’(x pei 
y f@)=v g'(«) f(x) Fae 9) 
whence 

—72 il 

S=0V1-e— f ode=2yi-- s+ f ae. 

V1—2? (=< 

So we have 
1 
28S=2 i+ | ae 

| ae 
i.e., 


1 1 
S=5e 1—a? + Saresine +c. 


Let us do this in a different way. Put y = arcsinx, so dx = cosydy and V1 — 2? = 
cosy. These give 


1 1 
s= a ydy== 5 | (cos2y + 1) dy = Fsin dy + Sy +c 


1 1 1 
= 5 siny cosy + gu te= 52 1—a? + Saresing +c. 


312 9 Integral calculus I 
vii) Finally, let us determine 


1 
I= da. 
er pe * 


Change y = e”, so dy = e*dz, or dz = dy. Then 


1 11 
faese- [ew 
e+e? ytiy 


1 
= | ——~ dy = arctany+c=arctane’ +c. OJ 
- Le ye Yy Yy 


Example ii) is a special case of the following useful relation 


/ alg) dx = log |y(x)| + ¢ (9.6) 


v(x) 


that descends from (9.5) by f(y) = = 


Hitherto all instances had one common feature: the maps f were built from a 
finite number of elementary functions by algebraic operations and compositions, 
and so were the primitives F’. In such a case, one says that f is integrable by 
elementary methods. Unfortunately though, not all functions arising this way 
are integrable by elementary methods. Consider f(«) = e-*, whose relevance in 
Probability Theory is paramount. It can be shown its primitives (which exist, for f 
is continuous on R) cannot be expressed by elementary functions. The same holds 

sin x 
for f(¢) = = 

The problem of finding an explicit primitive for a given function is highly non- 
trivial. A large class of maps which are integrable by elementary methods is that 
of rational functions. 


9.2.1 Integrating rational maps 


Consider maps of the general form 


where P(x) and Q(x) denote polynomials of degrees n,m (m > 1) respectively. 
We want to prove they admit primitives in terms of rational functions, logarithms 
and inverse tangent functions. 


First of all note that if n > m, we may divide P(x) by Q(x) 
P(2) = Q(a)D(a) + R(@), 


with D(x) a polynomial of degree n — m and R(x) of degree < m — 1. Therefore 


9.2 Rules of indefinite integration 313 


ee de= f D(z) d+ [5 da. 


The problem boils down to integrating a rational map g(x) = 


R(x) 


in which the 
Q(z) 


numerator’s degree is less than the denominator’s. 


We discuss a few simple situations of this type, which will turn out to be 
fundamental for treating the generic integrand. 


1 
i) Let g(x) = ——— with a € R; by (9.1) b) 


(9.8) 


iii) Let g(x) = , with p? —q < 0, so that the denominator has no real 


x2 + 2px+q 
roots and is positive. Putting 


s=V/q-—p? > 0, 


a little algebra shows 


a? + Qpe+q =a? +2pr+p?t+(q—p*) =(c4+p)? +8" =s 


Now substitute y = v(x) = 


/ 1 q = / 1 q 
4 al —————__ Ms 
x2 + 2px +q s2 tae y 


Recalling (9.1) f) we may conclude 


1 1 
/ a san C. (9.9) 
x* + 2px + q S 8 
b 
iv) Consider g(x) = eae with p? — q still negative. Due to the identity 


ax +b = ax + ap +b—ap = 5(2x + 2p) + (b— ap) 


314 9 Integral calculus I 
we write 
b 2 2 1 
[SS e- 5 | ee ae + (bap) f ae 
x2 + 2px + q 2) x2+2pr+4q x? + 2px + q 


Now use (9.6) with v(x) = x? + 2px + q, and (9.9): 


b 
/ wea dna = log(x? + 2px + q) + P arctan — ae +c. (9.10) 


x? + 2px + q 2 
ax +b 


—_—__—_—_., with p? — q < 0 and r > 1. Integ- 
(G4 dee 2a with p* — q and r nteg 


v) More generally, let g(x) = 
rating by parts 


1 
/ (x? + 2px + gq)? - 

and substituting y(x) = 2? + 2px + q, we end up writing the integral of g as sum 
of known terms, plus the integral of a map akin to g, but where the exponent is 
r—1. Thus the integrand to consider simplifies to one whose denominator is raised 
to a smaller power. From r = 1, solved above, we find r = 2, then r = 3 et cetera 
up to the given r, one step at a time. The argument’s details are left to the willing 
reader. 


Examples 9.14 


As direct application we compute 


1 1 
dx = = log |x —2 
Isa ie 5 log |x | +c, 


/ er eee: 
Gaet+5?.  3@r+5) 


Ae —5 Qn — 2 1 
SSS Fg 
[== * (== 2- fora” 


1 —1 
= 2log(x? — 2x + 10) — 3 arctan — +c. 
Reducing the integration of the general rational function g(x) = O(a) to the 
x 


previous special cases requires a factorisation of the denominator involving only 
terms like 

(c—a)” or (x? + 2px + q)° 
with p? — q < 0. The existence of such a decomposition descends from a version of 
the Fundamental Theorem of Algebra. 


Theorem 9.15 A polynomial Q(x) of degree m with real coefficients decom- 
poses uniquely as a product 


Q(x) = d(a—ay)" +--+ (a—ap)"™ (2? +2pi¢+q1)"! vee (2? +2p,2+q2)°**, (9.11) 


9.2 Rules of indefinite integration 315 


were, 0,,.),,0, are Teal and %,) Ss, integers such tat 


(a yy aye Ph UE She 


The a;, all distinct, are the real roots of Q counted with multiplicity r;. The 
factors x? +2p;¢+4; are pairwise distinct and irreducible over R, 7.e., pi —4; < 
0, and have two complez(-conjugate) roots GB; 4+ of multiplicity s;. 


Using the factorisation (9.11) of Q(x) we can now write g(x) as sum of partial 
fractions 


_ 7 ; [Fi(z) +--+ + F(x) + Fi(ax) +--+ + Fi(2)], (9.12) 
where each F(x) takes the form 
A; Aj Air, 
while F;(a) are like 
F;(a) = sees — ae ame gh —— aati a? 
a? + 2pju t+ qq (a? + 2pja + qy) (2? + 2pjx + q5)*4 


for suitable constants Ajp,Bj,,,Cj,. Note the total number of constants is ry + 
-2¢T pa 28 Pee a 28p = Mh. 

To recover the undetermined coefficients we can transform the right-hand side 
of (9.12) into one fraction, whose denominator is clearly Q(x). The numerator 
R(x) is a polynomial of degree < m— 1 that must coincide with R(x), and its 
coefficients are linear combinations of the unknown constants we are after. To find 
these numbers, the following principle on identity of polynomials is at our disposal. 


Theorem 9.16 Two polynomials of degree m-—1 coincide if and only if either 
of the next conditions holds 


a) the coefficients of corresponding monomials coincide; 
b) the polynomials assume the same values at m distinct points. 


The first equivalence is easily derived from Proposition 7.5. 


Going back to the m unknowns A,v, B;,,,Cj,, we could impose that the coef- 
ficients of each monomial in R(x) and R(x) be the same, or else choose m values 
of x where the polynomials must agree. In the latter case the best choice falls on 
the real zeroes of Q(x); should these be less than m in number, we could also take 
20; 

Once these coefficients have been determined, we can start integrating the 
right-hand side of (9.12) and rely on the fundamental cases i)—v) above. 


316 9 Integral calculus I 


As usual, the technique is best illustrated with a few examples. 


Examples 9.17 
i) Let us integrate 
203 + 2% —4dr +7 
f(z) = e+an—2 
The numerator has greater degree than the denominator, so we divide the poly- 


nomials 
x+65 


o+e—2 
The denominator factorises as Q(x) = (a — 1)(a + 2). Therefore the coefficients 
to be found, A; = Ay, and Az = Ag, should satisfy 
t+5 — Ay Ap 
g+e—-2 2-1 veo 


f(#) =2e-1+ 


that is to say 
a+ 5 = Ay(a+ 2)+ Ao(z = 1), (9.13) 
hence 
x+5=(A, + Ao)x + (2A1 — Ag). 


Comparing coefficients yields the linear system 


A, + Ag = 1, 
2A, — Ao =5, 
solved by A, = 2, Ap = —1. Another possibility is to compute (9.13) at the 


zeroes x = 1, x = —2 of Q(z), obtaining 6 = 3A; and 3 = —3A2, whence again 
A, = 2, Ag = —1. Therefore, 


[t@ar= fer-1ar+2 f Sar- f eae 


= 2? — 2+ 2log|z —1|—log|z+ 2|+<¢. 

ii) Determine a primitive of the function 
a oe 
f(a) = es —Qe2+ n° 

The denominator splits as Q(x) = a(x — 1)”, so we must search for Ay = Aj, 
Ao, and Age such that 

9 

—3r4+3 A A A 
x tg, Ew, 22 


gee—2n27+2 +x 2-1 (x — 1)?’ 


or 
x? — 3a +3= Ai (x = i)? + Agi x(x _ 1) + Agor. 

Putting x = 0 yields A, = 3, with x = 1 we find Agg = 1. The remaining 

Ag, is determined by picking a third value x 4 0,1. For instance x = —1 gives 

7= 12+ 2A, —1, so Ag; = —2. 


9.2 Rules of indefinite integration 317 


In conclusion, 


| te)ae=s f Zae-2 f Soars [ gas 


1 
= 8log|z| — 2log|z—1|—-—> +e. 
= 


iii) Integrate 


fe) 3027 ++ 4-4 
19 64 = -—-eo—:. 
v3 + 5H? + 92+ 5 
The point « = —1 annihilates the denominator (the sum of the odd-degree 


coefficients equals those of even degree), so the denominator splits Q(x) = 
(2 + 1)(x? +42 +5) by Ruffini’s rule. The unknown coefficients are A = Aj1, 
B= By, C = Ci so that 
Se? ge aa _ A Ba+C 
gabe? +On+5 2+1  e+4e+6 
hence 
307 +¢—4= A(z? + 4¢4+5)+(Br+C)(x+1). 
Choosing x = —1, and then x = 0, produces A = —1 and C = 1. The last 
coefficient B = 4 is found by taking x = —1. Thus 
1 4x +1 


[i@ew=-f oat fap” 


1 22 +4 1 
=-— d 2 | —————— dx —7 | ————— d 
Is oe /== . ences, * 


= — log |a + 1] + 2log(x? + 42 + 5) — 7arctan(# +2) +. 


Note that many functions f(x) that are not rational in the variable x can be 
transformed — by an appropriate change t = v(x) — into a rational map in the new 
variable t. Special cases thereof include: 


i) f is a rational function of %/x — a for some integer p and a real. Then one lets 
t= Wx—-a, whence « =a+t? and dz = pt?~'dt. 
ii) f is rational in e** for some real a 4 0. The substitution 
t=e 


1 1 
gives x =-—logt and dx = —d?. 
a at 


iii) f is rational in sina and/or cos. In this case 


t= tan — 
= tan — 
2 2 
together with the identities 
2t lena? 
sin = COS & (9.14) 


1+ i’ 


318 9 Integral calculus I 


does the job, because then x = 2 arctant, hence 


(9.15) 


_ 2 
1+ 
iv) If f is rational in sin? x, cos? x, tan x, it is more convenient to set t = tana 
and use 
Fre ee Ceca: 
14 t?’ 1+¢?’ 


from x = arctant, it follows 


1 
dx = ——dt. 
is 1+ t? 


(9.16) 


(9.17) 


In the concluding examples we only indicate how to arrive at a rational expres- 
sion in t, leaving it to the reader to integrate and return to the original variable x. 


Examples 9.18 
i) Consider 


few 
= | ———=-.dz 
1l+vVJa2-1 


We let t= Vx — 1, so x = 14+ ?#? and dz = 2¢dt. The substitution gives 


2 
1+¢ 


ii) The integral 


e * 
s= | —————-d 
[=a ‘s 


becomes, by t = e”, dx = + dt, 


1 
S= | == et. 
/ t?(t? — 2t + 2) 
iii) Reduce the integrand in 
cs = sin & 
1+ sin - 
to a rational map. 


Referring to (9.14) and (9.15), 
t 


s-4/ oppo 


1 
1+sin* xz 


Here we use (9.16) and (9.17): 


i 
s= | ae 


iv) At last, consider 


9.3 Definite integrals 319 


Figure 9.2. Trapezoidal region of f over [a, 5] 


9.3 Definite integrals 


Let us consider a bounded map f defined on a bounded and closed interval J = 
[a,b] C R. One suggestively calls trapezoidal region of f over the interval 
[a, b], denoted by T(f; a,b), the part of plane enclosed within the interval [a, b], the 
vertical lines passing through the end-points a,b and the graph of f (see Fig. 9.2) 


(which constraint on y clearly depending on the sign of f(x)). 

Under suitable assumptions on f one can associate to the trapezoidal region of 
f over [a,b] a number, the ‘definite integral of f over [a, b]’. In case f is positive, 
this number is indeed the area of the region. In particular, when the region is 
particularly simple (a rectangle, a triangle, a trapezium and the like), the definite 
integral returns one of the classical formulas of elementary geometry. 

The many notions of definite integral depend on what is demanded of the 
integrand. We shall present two types. The first one, normally linked to the name 
of Cauchy, deals with continuous or piecewise-continuous maps on |a, }]. 


Definition 9.19 A map f : [a,b] > R is piecewise-continuous when it 


is continuous everywhere except at a finite number of points, at which the 
discontinuity is either removable or a jump. 


The second construction goes back to Riemann, and leads to a wider class of 
integrable functions!. 


' A further type, known as Lebesgue integral, defines yet another set of integrable func- 
tions, which turns out to be the most natural in modern applications. This theory 
though goes beyond the purposes of the present textbook. 


320 9 Integral calculus I 
9.4 The Cauchy integral 


To start with, we assume f continuous on [a,b], and generalise slighty at a suc- 
cessive step. The idea is to construct a sequence that approximates the trapezoidal 
region of f, and then take a limit-of-sorts. Let us see how. 

Take n any positive integer. Divide |a, b] in n equal parts of length Ar = ba 
and denote by 7, = a+kAz, k = 0,1,...,n, the subdivision points; note that 
they are ordered increasingly by the index, as a = % < 4% <<... < &%n-1 <%= 0. 
For k = 1,...,n, we denote by J; the interval [x,_1, x]. The map f is continuous 
on |{a,b|, hence by restriction on each J;,; Weierstrass’s theorem 4.31 implies f 
assumes minimum and maximum on Ix, say 


im~n = min Jie M, = max f(s). 


Define now the quantities 


called respectively lower sum and upper sum of f for the above partition of [a, )]. 
By definition mz < My, and Ax > 0, so sy, < Sy. 

When f is positive on |a, b|, the meaning is immediate (Fig. 9.3): m, Az repres- 
ents the area of the rectangle r, = I, x [0, mg], contained in the trapezoidal region 
of f over I. Thus, s, is the total area of the rectangles r, and approximates from 
below the area of 7(f;a, 6). For the same reasons, S,, is the area of the union of 
of rectangles Ry, = I, x [0, Mj], and it approximates 7(f;a,b) from above. 


Using properties of continuous maps defined on closed and bounded intervals, 
we can prove the following result (see Appendix A.5.1, p. 461). 


Theorem 9.20 The sequences {s,,} and {S,,} are convergent, and their lim- 


its coincide. 


y = f(z) i y = f(z) 
Mk 
b a - 
° a va a Tk b 


Figure 9.3. Lower sum (left) and upper sum (right) of f on {a, 5] 


9.4 The Cauchy integral 321 


Based on this fact, we can introduce the definite integral. 


Definition 9.21 One calls definite integral of f over [a,b] the number 


(which we read integral from a to b of f(a)dax or just integral from a to b 


of f). 


Examples 9.22 


i) Take a constant f on [a,b]. If c is its value, then mz, = M; = c for any k, so 


Sn = Sn =c) At =c(b—a) 
k=1 


b 
whichever n. Therefore / f(z) dx = c(b— a). 


ii) Consider f(a) = x over [0,1]. The region 7(2;0,1) is the isosceles right 
triangle of vertices A = (0,0), B = (1,0), C = (1,1) that has area 4. We want 
to check the definite integral of f over [0,1] gives the same result. Fix n > 1. 
Then Ax = + and, for k= Os20. 30 ES A. Since f is increasing, mp = LR_-1 
and M;, = xx, so 


wn = Dom An = (k — 1), S,= oade= 3 ok 
k=1 


Now S- k is the sum of the first n natural numbers, hence ninth) by (8.2). For 


k=1 
n 


analogous reasons Sok — 1) is the sum of natural numbers from 0 (or 1) to 


k=1 
n—1, and equals (n-U)n whence 
n(n — 1) n(n + 1) 
n— ; Sin — 
‘ 2n? 2n? 


Taking the limit for n — oo of these sequences, we find 5 for both. 


This example shows that even for a function as harmless as f(x) = x, computing 
the definite integral using the definition is rather demanding. Obviously one would 
hope to have more efficient tools to calculate integrals of continuous maps. For that 
we shall have to wait until Sect. 9.8. 


322 9 Integral calculus I 


We discuss now the extension of the notion of definite integral. If f is continuous 
on [a,b] and x* denotes an interior point of the interval, it is possible to prove 


[iow fajaet [ voce 


This formula’s meaning is evident, and it suggests how to define integrals of 
piecewise-continuous maps. Let t = a < @ <... < Lm_1 < ®m = b be the 
points where f is not continuous, lying between a and b (assuming the latter 
might be discontinuity points too). Let f; be the restriction of f to the interior of 
[x;-1, 2] that extends f continuously at the boundary 


lim: f (2), tore Soa. 


zat, 
Jag) = F@); for Ga) << Li. 
lim tear for 2 = He 


We define 


If f is genuinely continuous on [a, b], the above box coincides with Definition 9.21, 
because m = 1 and the map /; is f. 

Moreover, it follows immediately that modifying a (piecewise-)continuous map 
at a finite number of points will not alter its definite integral. 

The study of Cauchy’s integral will be resumed with Sect. 9.6. 


9.5 The Riemann integral 


Throughout the section f will indicate a bounded map on [a,b]. Let us start from 
integrating some elementary functions (called step functions), and slowly proceed 
to more general maps, whose integral builds upon the former type by means of 
upper and lower bounds. 

Choose n + 1 points of [a,b] (not necessarily uniformly spread) 


aAa=% <a <...<Upn_-1 < Un =D. 


They induce a partition of [a,b] into sub-intervals I, = [v,_-1, 2%], kK = 1,...,n. 
Dividing further one of the J, we obtain a so-called finer partition, also known as 
refinement of the initial partition. Step functions are constant on each subinterval 
of a partition of [a,b], see Fig. 9.4. More precisely, 


9.5 The Riemann integral 323 


A 
ci 4+ 
C4 ——_—. 
C2 —$——— : 
+ + ad 
a= Xo Ly r2 x3 t4=b 
c3 + ——_o 


Figure 9.4. Graph of a step function on [a, b] 


Definition 9.23 A map f : [a,b] > R is a step function if there exist a 
partition of |a, b] by {xo,41,.-.,2%n} together with constants c1,Cc2,...,Cn € R 


such that 
toy =e8 Dae =a — le ee 


We say that the partition is adapted to f if f is constant on each interval 
(tp-1, 2%). Refinements of adapted partitions are still adapted. In particular if 
f and g are step functions on [a, }], it is always possible to manifacture a parti- 
tion that is adapted to both maps just by taking the union of the points of two 
partitions adapted to f and g, respectively. 

From now on S([a, b]) will denote the set of step functions on [a, 6]. 


Definition 9.24 Let f € S({a,b]) and {x0,21,...,%n} be an adapted parti- 
tion. Call cy, the constant value of f on (a~-1,2%). Then the number 


[is = So ce(xe — te-1) 
# k=1 


is called definite integral of f on I = [a, J. 


A few remarks are necessary. 


i) The definition is independent of the chosen partition. In particular, if f(x) =c 
is constant on [a, b], [is = c(b—a). 
I 


ii) Redefining f at a finite number of places leaves the integral unchanged; in 
particular, the definite integral does not depend upon the values of f at points 
of discontinuity. 


In case f is positive on J, the number [ ,J/ represents precisely the area of the 
trapezoidal region of f over J: the latter is in fact the sum of rectangles with base 
Lk — Le—1 and height c, (Fig. 9.5). 

The next result will play an important role. 


324 9 Integral calculus I 


A 


a 


a= Xo X1 x2 L3 ta=b 


Figure 9.5. Region under a positive step function on the interval [a, }] 


Property 9.25 If g,h € S(|a,6]) are such that g(x) < h(x), Va € [a, b], then 


[os 


Proof. Let {%o,21,...,%n} define a partition adapted to both maps (this exists 
by what said earlier). Call c, and d, the values assumed on (1,1, 2%) by 
g and h, respectively. By hypothesis c, < dy, k = 1,...,, so 


[s = See (ae 224) Ss S > dk( re —Zp-1) = [ne 
# k=1 k=1 BA 


Now let f : [a,b] + R be a generic bounded map, and put 


sp = sup f(z) ER and ip = inf f(x) ER. 
x€[a,b] r€ [a,b] 


We introduce the sets of step functions bounding f from above or from below, 
namely 


St = {h E S((a,b]) : f(x) < h(a), Ve € [a, pl 


contains all step functions bigger than f, while 
S7 = {9 € S((a,8)) : g(x) < f(@), Ve € [a8] 


contains all those smaller than f. These are not empty, for they contain at least 
the constant maps 
hie) Say and j2)=T,, 


It then makes sense to look at the sets of definite integrals. 


9.5 The Riemann integral 325 


Definition 9.26 The number 


[rem{ frcnesy| 


is called the upper integral of f on I = |a,b], and 


[r=s04 [aves7} 


the lower integral of f on I = [a,}]. 


As SF # i), clearly If < +00, and similarly Jif > —oo. The fact that such 
quantities are finite relies on the following. 


Property 9.27 Each bounded map f defined on |a, b] satisfies 


ele 


Proof. Ifg€ Sy and h € S}, by definition 


g(t) < f(#) < h(a), Va € [a,b], 


[eso 


Keeping g fixed and varying h we have 


fos fs 


Now varying g in this inequality proves the claim. 0 


so Property 9.25 implies 


At this stage one could ask whether equality in (9.27) holds by any chance for 
all bounded maps. The answer is no, as the example tells. 


Example 9.28 


The Dirichlet function is defined as follows 


1 ifxteQ, 
ro)={ a 


326 9 Integral calculus I 


Each interval (#,~1, 2%) of a partition of [0, 1] contains rational and non-rational 
points. Step functions in S? are all larger than one, whereas the maps in Sy 
will be non-positive (except at a finite number of places). In conclusion 


[re and [r-% 


Our observation motivates introducing the next term. 


Definition 9.29 A bounded map f on I = [a,b] is said integrable (pre- 
cisely: Riemann integrable) on I if 


[r=] 


The common value is called definite integral of f on [a,b], and denoted 


with f, f or Lf) da, 


When f is a positive map on |a, b] the geometric meaning of the definite integral 
is quite clear: T(f;a,b) is a subset of T(h;a,b) for any function h € SF, and 
contains T (g; a, b) relative to any g € S; . The upper integral gives thus an estimate 
from above (i.e., larger) of the area of the trapezoidal region of f over J, and 
similarly, the lower integral represents an approximation from below. Essentially, 
f is integrable when these two coincide, hence when the integral ‘is’ the area of 
the trapezoidal region of f. 

Step functions f are evidently integrable: denoting by [. ,J the quantity of 
Definition 9.24, the fact that f € S; implies J, f < Jif; and Jif < J,f is 


consequence of f € SF. Therefore 


[tsfrsfrs fi 


and the upper integral must be equal to the lower. 


Beyond step functions, the world of integrable maps is vast. 


Example 9.30 


Consider f(x) = 2a on [0,1]. We verify by Riemann integration that the 
trapezoidal region of f measures indeed 1/2. Divide (0, 1] into n > 1 equal parts, a 
partition corresponding to the points {0,4,2,...,"=4,1} = {£:k=0,...,n}. 
Now take the step functions 


: 
inet) =< 1 
0 


9.5 The Riemann integral 327 


and 


—1 —1 
i it po | a een 3) 
n n 
0 if ¢ = 0. 
Since gn(x) < f(x) < h,(ax), Ve € [0,1], it follows h, € St gn © S; . Moreover 
by (3.2), 
“k fk k-1 “kk 1 In(n+1) 1. 1 
hn — —_ _ — — —_—_>_ = k >= OO — 
ie ae n ) do aaret i ane” 21 on 
and similarly 
i; oe. 
2 On 
These imply 
vi i ) 
fsint [hn = 5 and feoup fon = 5, 
I Wedd 2 JI n JI 
hence 
f iN 
ress ft. 
I 2 I 


Recalling 9.27 we conclude f, f = 3. 


Studying the integrability of a map by means of the definition is rather non-trivial, 
even when one deals with maps having simple expression. So it would be good on 
the one hand to know in advance a large class of integrable maps, on the other 
to have powerful methods for computing integrals. While the second point will be 
addressed in Sect. 9.8, the result we state next is a relatively broad answer to the 
former problem; its proof may be found in Appendix A.5.2, p. 463. 


Theorem 9.31 Among the class of integrable maps on [a,b] are 


a) continuous maps on |a, }]; 


b) piecewise-continuous maps on |a, b]; 


c) continuous maps on (a,b) which are bounded on [a,b]; 


ad) monotone functions on [a,b]. 


As an application of the theorem, 
me: g 
1l+sin- if0<a2<1, 
x 
0 i=), 


is integrable, for continuous on (0, 1] and bounded (by 0 and 2) on (0, 1]. 


328 9 Integral calculus I 


1 
1 
2 
1 
3 
1 
4 
0 1 ” oa 1 


Figure 9.6. Integrable maps on {0, 1] 


The same for 


which is increasing (not strictly) on [0,1], see Fig. 9.6. 
A couple more properties will be useful later; see Appendix A.5.2, p. 466, for 
their proof. 


Proposition 9.32 If f is integrable on [a,b], then 


i) f is integrable on any subinterval [c,d] C [a,b]; 


ii) |f| is integrable on {a, b}. 


9.6 Properties of definite integrals 


A (piecewise-)continuous map is Cauchy integrable (Theorem 9.20) and at the 
same time integrable following Riemann (Theorem 9.31). The two types of definite 
integral always agree for such maps, as seen explicitly for f(a) = x in Examples 
9.22 ii), 9.30. We shall not prove this fact rigorously. Anyhow, that reason is 
good enough to use a unique symbol for both Riemann’s and Cauchy’s integrals. 
Henceforth R(|a, b]) shall be the set of integrable maps on [a, ]. 

Recall fp f(x) dz is a number, depending only on f and the interval [a,b]; it 
certainly depends upon no variable. The letter x, present in the symbol for histor- 
ical reasons essentially, is a ‘virtual variable’, and as such may be substituted by 
one’s own preferred letter; writing ft f (x) da, rather than i f(s) ds or i f(y) dy 
is a matter of taste, for all three symbols represent the same number. 


9.6 Properties of definite integrals 329 


b 
Figure 9.7. The area of the trapezoidal region of f on [a, }] is / |f(x)| dx 


If f € R({a, b]) is positive we have shown the definite integral expresses the area 
of the trapezoidal region of f over [a,b]. For negative f the same holds provided 
one changes sign to the value. When f has no fixed sign, the integral measures 
the difference of the positive regions (above the x-axis) and the negative regions 
(below it), so the area between f and the horizontal axis is also the integral of the 
map |f| 


b 
Me OPE / elias, 


This is due to the symmetrising effect of the absolute value, which reflects the 
regions lying below the axis in a rigid way (as in Fig. 9.7). 

Finally, let us slightly generalise the definite integral. Take f € R({a, b]). For 
a<xc<d<b, set 


(9.18) 


The symbol fe f(x) dz is now defined whichever limits c and d we consider in the 
integrability domain [a, }]. 


The following five properties descend immediately from the definition. 


Theorem 9.33 Let f and g be integrable on a bounded interval I of the real 
line. 


i) (Additivity with respect to the domain of integration) For any 


[ soa = [seat [sean 


G,0,Cea. 


330 9 Integral calculus I 


ii) (Linearity) For any a,b € I anda,B ER, 


i (af (x) + Bo(e)) dnaaf s@ae+e f o(e)ae 


iii) (Positivity) Let a,b € I, witha < b. If f > 0 on [a,b] then 


b 
/ ie ele 
If f is additionally continuous, equality holds if and only if f is the zero 
map. 


iv) (Monotonicity) Let a,b €I,a< b. If f <g in [a,b], then 


ns (Ge) oue [9 cle, 


v) (Upper and lower bounds) Let a,b € I, a < b. Then 


[ soa < f \@lae 


Proof. See Appendix A.5.2, p. 467. 


9.7 Integral mean value 


The definite integral of an integrable map f over the usual real interval [a, 5] 
furnishes a way of approximating the function’s behaviour by a constant. 


Definition 9.34 By (integral) mean value (sometimes integral average) 
of f over the interval [a,b] one understands the number 


b 
m(fsa,b) = ——— f F(e)ax, 


The geometric meaning is clear when f is positive on [a, b], for an equivalent version 
of the mean value reads 


b 
/ f(a) de = (b — a)m(f;a,). 


9.7 Integral mean value 331 


y = f(z) 
m(f; a,b) —_— 
a b " 


Figure 9.8. Integral average of f over {a, 5] 


In this case 7 (f; a,b) equals the area of the rectangle with base [a,b] and having 
the integral average as height (Fig. 9.8). 

The next statement formalises the relation between the integral mean value of 
a function and its range. 


Theorem 9.35 (Mean Value Theorem) Let f be integrable over {a,b}. 
The integral mean of f over |a, b| satisfies 


inf f(x) <m(f;a,b) < sup f(z). (9.19) 
xe [a,b] x€[a,b] 


If moreover f is continuous on |a, b], there is at least one z € [a,b] such that 


m(f; a,b) = f(z). (9.20) 


Proot Call¢,= i f(x) and sr = sup f(x), so for any x € [a, 5] 
rela, x€[a,b] 


ig < f(z) < 55. 


By property iv) of Theorem 9.33 


b b b 
(-a)iz= | isde < [ Fla)ax < f sp dx = (b—a) sy. 


where we have used the expression for the integral of a constant. Now 
divide by b — a to attain (9.19). 
Supposing f continuous, Weierstrass’s Theorem 4.31 yields 

) — i a d = ma ) 

= f(x) n a= f(z) 
hence (9.19) tells that m(f; a,b) lies between the maximum and minimum 
of f on [a, b]. The existence of a point z for which (9.20) holds then follows 
from (4.16). 


332 9 Integral calculus I 


A A 


m(f;0, 2) ~~ 


Figure 9.9. The Mean Value Theorem of integral calculus 


Example 9.36 
The integral mean of the continuous map 
27 if0<2< 1, 
fea) = {5 iffl<a2<2, 
over [0, 2] is 


m(f;0,2) = tf 5 av= 5 (fo dvae+ f 2dz) = 30+2)= 3 


In conformity with the statement, the mean value is indeed a value the function 
takes, in fact m(f;0,2) = f(#) (Fig. 9.9, left). 
Consider now the piecewise-continuous map 

27 fO<a@ <1, 

fa) = 4 

a  itgoe ds 
The mean value over |0,5/4] is m(f;0,5/4) = f(9/10) and belongs to the map’s 
range; this is not so when we consider [0,2], because m(f;0,2) = 3 (Fig. 9.9, 


right). This example shows that the continuity of f is just a sufficient condition 
for (9.20) to hold. Oo 


A closing remark for the sequel. Taking (9.18) into account, we observe that 
the mean value formula stays valid if the limits of integration are interchanged, 
hence the theorem is correct also when a > b: 


m(f; a, b) of f(z = dn = 5 f Hlayde = mi fsb.) (9.21) 


9.8 The Fundamental Theorem of integral calculus 333 


9.8 The Fundamental Theorem of integral calculus 


Let f be defined on the real interval J, which we do not assume bounded necessarily, 
but suppose f is integrable on every closed and bounded subinterval of J. This is 
the case if f is continuous. We call integral function of f on J any map of the 
form 


(9.22) 


where xo € J is a fixed point and x varies in J. An integral function is thus obtained 
by integrating f on an interval in which one end-point is fixed while the other is 
variable. By (9.18) any integral function is defined on the whole J, and F,, has a 
ZerO at Xo. 

The Fundamental Theorem of integral calculus establishes the basic inverse 
character of the operations of differentiation and integration, namely that any 
integral function of a given continuous map f over J is a primitive of f on that 
interval. 


Theorem 9.37 (fundamental of integral calculus) Let f be defined and 
continuous over a real interval I. Given xo € I, let 


PG) = iL f(s) ds 


denote an integral function of f on I. Then F is differentiable everywhere 
over I and 


Va El. 


Proof. Let us start by fixing an z inside J and calling Az an increment (positive 
or negative) such that x+ Az belongs to J. Consider the difference quotient 


of F 
x e\ <= x et+Anz £ 
F( a 7 riyae [ soas). 


ie} 
By property i) in Theorem 9.33, 


i we eae | F(s)as+ f me f(s) ds 


0 


sO 


F(a + Az) — F(z) 1 
ie pat 


334 


9 Integral calculus I 


Lo gz z(Ar) «+Az 


Figure 9.10. The Fundamental Theorem of integral calculus 


Thus, the difference quotient of the integral function Ff’ between x and 
x + Ax equals the mean value of f between x and x + Az. Since f is 
continuous, the Mean Value Theorem 9.35 guarantees the existence in that 
interval of a z = z(Az) for which m(f;z,z + Ax) = f(z(Az)), in other 
words 

F(a + Az) — F(a) 

Ax 

Take the limit for Ax — 0. For simplicity we can assume Az > 0. From 


= F(a Ae), (9.23) 


g << 2(Ag) < 2+ Ae 
and Theorem 4.5 we deduce that 


lin el eh = we, 
Ax—0+ 


By similar arguments lim z(Az) = 2, so lim z(Az) = x. But f is 
Az—-0- Azx—0 
continuous at x, hence (4.11) implies 
Ag))=f( Ti Az)) = . 
Jim, f((4e)) = F( Jim 2(Ae)) = f(0) 
Passing to the limit in (9.23), we find the thesis 


_ F(a+ Ar) — F(x) 
OI a 
ls pear pale 


= f(x). 


In case x is a boundary point of J it suffices to take one-sided limits instead, 
and the same conclusion follows. 


9.8 The Fundamental Theorem of integral calculus 335 


Corollary 9.38 Let F,, be an integral function of a continuous f on I. Then 


Hel — Go) = Gn) Vael 


for any primitive map G of f on I. 


Proof. There exists a number c with F,,,(%) = G(x) —c, Vax € I by Theorem 9.4. 
The constant is fixed by the condition F,,(20) = 0. 


The next corollary has great importance, for it provides the definite integral 
by means of an arbitrary primitive of the integrand. 


Corollary 9.39 Let f be continuous on [a,b] and G any primitive of f on 
that interval. Then 


b 
if HOME aCe Cel (9.24) 


Proof. Denoting F, the integral map vanishing at a, one has 


b 
/ fede = F,(bi: 


The previous corollary proves the claim once we put %o = a,x = b. 


Very often the difference G(b) — G(a) is written as 


Examples 9.40 
The three integrals below are computed using (9.24). 


1 
14) 1 
2 3 
we -} 
[ sae 3° lo 3 


| sinedx = [— cosa]? =2. 
0 

° 6 
| dz = [log x], = log6 — log 2 = log3. O 
z 


Rlre 


336 9 Integral calculus I 


Remark 9.41 There is a generalisation of the Fundamental Theorem of integ- 
ral calculus to piecewise-continuous maps, which goes like this. If f is piecewise- 
continuous on all closed and bounded subintervals of J, then any integral function 
F on I is continuous on J, it is differentiable at all points where f is continuous, 
and F’(x) = f(x). Jump discontinuities for f inside J correspond to corner points 
for F’. 

The integral F' is called then a generalised primitive of f on I. ‘ai 


Now we present an integral representation of a differentiable map, which turns 
out to be useful in many circumstances. 


Corollary 9.42 Given f differentiable on I with continuous first derivative, 
and any xo € I, 


Fla) = Flo) + f F's) ds (0.25) 


Proof. Obviously f is a primitive of its own derivative, so (9.24) gives 


i “Hate Gi Pies 


whence the result follows. 


We illustrate this result by providing two applications. The first one is the 
justification of the Maclaurin expansion of f(x) = arcsinz and f(x) = arctanz. 
First though, a technical lemma. 


Lemma 9.43 If ~ is a continuous map around 0 such that p(x) = o(x%) for 
x — 0, and a > 0, then the primitive p(x) = i p(s) ds satisfies w(x) = 
o(x**") as x +0. This can be written as 


a OG yds — ole am | forz > 0. 
0 


Proof. From de |’H6pital’s Theorem 6.41, 
/ 
il 
V(r) W(@) sen UE) 


ber, got = (a+ 1)ae rls xe = 
. _— 1 
So now take f(z) = arctanz. As its derivative reads f’(x) = rey (9.25) 
x 


allows us to write 


ot 
arctan x® = | —~ ds. 
0 1 + gs? 


9.8 The Fundamental Theorem of integral calculus 337 


The Maclaurin expansion of f’(s), obtained from (7.18) changing x = s?, reads 
1 
ae 1—s*+s*—...+(-1)"s?™ + 0(s?™*") 
= S 5 (-1)*s"* OTe Cais 
=0 


Term-by-term integration together with (9.26) yields Maclaurin’s expansion for f(x): 


2 ge 


t =r-—+—-... 
arctan « = x 3° 5 


g2ktl 


2 


As for the inverse sine, write 


me 1 
x) = arcsinxz = ——- ds. 
f(x) / — 


Now use (7.17) with a = —4 and change x = —s?: 


3 


3 5 ell 2m+1 
arsine 24 T+ 4.4 |( ”)| = Oe i) 


6 40 (ey) its i 


As a second application of Corollary 9.42, we derive a new form for the re- 
mainder in a Taylor formula, which adds to the already known expressions due to 
Peano and Lagrange (recall formulas (7.6) and (7.8)). Such a form, called integral 
form, may provide more accurate information on the behaviour of the error than 
the previous ones, although under a stronger assumption on the function f. The 
proof of this result, that makes use of the Principle of Induction, is given in Ap- 
pendix A.4.4, p. 458, where the reader may also find an example of application of 
the new form. 


338 9 Integral calculus I 


Theorem 9.44 (Taylor formula with integral remainder) Let n > 0 
be an arbitrary integer, f differentiable n+ 1 times around a point xo, with 
continuous derivative of ordern +1. Then 


f(t) —Tfrao() = - Hl POD) (@ — 8)" dt. 


9.9 Rules of definite integration 


The Fundamental Theorem of integral calculus and the rules that apply to indef- 
inite integrals, presented in Sect. 9.2, furnish similar results for definite integrals. 


Theorem 9.45 (Integration by parts) Let f and g be differentiable with 
continuity on [a,b]. Then 


b b 
/ f(a)g'(e) dex = [f(a)g(a)]2 - / f'(a)g(@) de. (9.27) 


Proof. If H(x) denotes any primitive of f’(x)g(a) on [a,b], the known result 
on integration by parts prescribes that f(x)g(z) — H(az) is a primitive 
of f(x)g'(x). Thus (9.24) implies 


b 
/ f(a)g'(«) dx = [f(a)g(a)]° — [za]? 


It then suffices to use (9.24) on the map f’(x)g(z). 


Theorem 9.46 (Integration by substitution) Let f(y) be continuous on 
[a,b]. Take a map v(x) from |a, 8] to [a,b], differentiable with continuity. 
Then 

[ te@)e@ac= fo tu)ay. (9.28) 


p(a) 


If ~ bijects |a, 8] onto [a, bl, this formula may be written as 


b p*(b) 
/ fly) dy = / f (v(2)) ¢' (2) de. 
e p-*(a) 


9.9 Rules of definite integration 339 


Proof. Let F(y) be a primitive of f(y) on [a,b]. Formula (9.28) follows from (9.4) 
and Corollary 9.39. When » is bijective, the two formulas are equivalent 
for a = y(a), b= y() if ¢ is strictly increasing, and a = y(8), b= y(a) 
if strictly decreasing. 


Both formulas are used in concrete applications. 


Examples 9.47 
i) Compute 


| sin® x cos x dz. 
0 


Set y = v(x) =sinz, so that y'(x) = cosz, y(0) = 0, y(32) = +. From (9.28) 


J2 
we obtain 
3m 
- 
/ sin? x cos x dx 24 y® dy = 
0 0 


: oe : 3r 
Note ¢ is not injective on [0, =]. 


sh 
1 
eH 
eS 
pe 
ah 
— 


ii) To determine 


1 
s= | arcsin \/1 — y? dy, 
0 


we change y = y(x) = cos, with x varying in [0, 5]. On this interval ¢ is strictly 
decreasing, hence one-to-one; moreover y(0) = 1 and y(F) = 0, ie., p~"(0) = F, 
y +(1) = 0. Note also 


arcsin 1 — cos? x = arcsin V sin? x = arcsin(sinz) = 2. 
Formula (9.29) gives 


0 n/2 
= / (arcsin \/1 — cos? x) (— sina) dx = | xsinz daz, 
a /2 0 


and eventually we may use (9.27) 


wm /2 
S=[-zeosa]y”+ [ cosa da = [sina]*/? = 1. 
0 


Corollary 9.48 Let f be integrable on the interval [—a,a], a> 0. If f is an 
even map, 


[sade =2 f ployer; 


if f as odd, 


340 9 Integral calculus I 


Proof. Theorem 9.33 i) gives 


"peace = fi sears f° s(a)ae. 


Substitute y = y(a) = —ax in the middle integral 


0 


=@ 


0 a 
f(a)ax =~ | f(-u) ay = | f(-y) dy. 


The right-most integral coincides with | f(y) dy if f is even, with its op- 


0 
posite when f is odd. The claim follows because the variable of integration 
is a mute symbol. oO 


9.9.1 Application: computation of areas 


This first chapter on integrals ends with a few examples of the use of the Funda- 
mental Theorem to determine the area of planar regions. 


i) Suppose we are asked to find the area A of the region enclosed by the graphs 
of the maps y = f(x) = 2? and y = g(x) = V2, given in Fig.9.11. The curves 
meet at points corresponding to x = 0 and x = 1, and the region in question can 
be seen as difference between the trapezoidal region of g and the trapezoidal 
region of f, both over the interval [0, 1]. Therefore 


0 1 


Figure 9.11. Region bounded by the graphs of f(a) = x? and g(x) = x 


9.9 Rules of definite integration 341 


0 Tr 


Figure 9.12. Area under y = Vr? — x? in the first quadrant 


ii) In the second example we check the known relation A(r) = mr? for the area 
of a disc in function of its radius r. The ae conte at the origin with radius 
r is the set of points (x,y) such that x? + y? < r?. The quarter is then the 
trapezoidal region of y = Vr? — «? relative to [0,7] (Fig, 9.12), so 


naa Vira daz, 
0 


Let us change variables by x = y(t) = rt, so that dz = rdt and 0 = y(0), r= 
(1). Because of (9.29), we have 


A(r) = 4r? a Sie & (9.30) 


From Example 9.13 vi), we already know a primitive of f(t) = V1 —?? is 


1 1 
FUi)= 5t 1-—¢?+4 a arcsint . 


Therefore 


1 il i 
A(r) = Ar? | =t\/1 — #2 4+ =arcsint} = 4p? — rr, 
2 ) i 4 


iii) We compute the area A of the part of pene pounded by the parabola y = 
f(z) = x(1 — x) and the line y = g(x) = —§ (Fig.9.13, Jen); The curves 
intersect at the origin and at (3, —3), plus on tie interval (0, 3] we have f(x) > 
g(x). Albeit part of the region aueilaps the negative half-plane (where y < 0), 
the total area can still be calculated by 


3/2 
A= f° (s(e)-9(e)) de 


342 9 Integral calculus I 


A A 


Figure 9.13. The area bounded by the graphs of f(x) and g() is translation-invariant 


for the following reason. The number A is also the area of the region bounded 
by the graphs of the translated maps f(x) + 3 and g(x) + 3; shifting the x-axis 
vertically so that the origin goes to (0, —#), does not alter the surface area 
(Fig. 9.13, right). So, 


A/F 3 eae bel & 
A= ee ee | 2 eS ae 
/ (G0 1?) ar Fa mal iG 


9.10 Exercises 


1. Determine the general primitive of : 


a) f(x) =(@ +1)" b) faye ae 
1 2 — sin 


2. Find the primitive map taking the value yo at xo of the functions: 


f(x) = xe? myaa/2 to = 1 
a2 
log x 

e) f(@)=— my =e yo =0 


d) f(x) =cosxe®n* n= 5 yo =e 


3. Compute the indefinite integrals: 


x 
» fame 
el/x* 


[)] feevirear 


4. Compute the indefinite integrals: 


a) / ax” sin x dx 


/ log? x dx 


e) [eo cos xz dx 


5. Compute the indefinite integrals: 


22 
) [sae 
x 
ial 
[)] | x - 4 
v3 — x? 


6. Compute the indefinite integrals: 


2x 


| [| Soa 
1+ 
Oi rercrti 


1 
e) 7 da 
cos x 


7. Compute the indefinite integrals: 


| 
Ol / aaa” 


9.10 Exercises 


i 
a | 5— dx 
x log® x 


b) a log 2x da 


: x arctan x dx 


ON ieee 


b) ——_—— 


xe? —5r+6 


17x? — 162 + 60 
d) [Are 


eS = Oe" + 7e+3 
a : 
4)(a —1)? 
1 
d ——d 
— . 
py / SS cos? x 
1 — 2sin? =" 


b) mre 


1 
od) face 


343 


344 9 Integral calculus I 


e) [ cost? cde [re V1l+a2dr 
1 1 
——d h ———d 
i/o . »)| fa . 
[sno vas [costxae 
Write the primitive of f(x) = |x|log(2 —) that has a zero at x = 1. 


Find the primitive F(x) of f(x) =xe7'*! with lim F(a) = —5. 


xZ—-+00 


What is the primitive, on the interval (—3, +00), of 


e+2 
f(«) = ———————- 
(|x| + 3)(@ — 3) 
that vanishes at x = 0? 
Determine the generalised primitive of 


_ |e? = bee 8 fe S41, 
jaa\ es ife <1 


vanishing at the origin. 


Verify that 


1 T 
arctan — = 57 arctan, Va >0. 
x 


Write the Maclaurin expansion of order 9 for the generic primitive of the map 
f(z) = cos2z2?. 
Write the Maclaurin expansion of order 4 for the generic primitive of the map 
2+e ” 
i) = ———. 
F(2) 34+2° 


15. Determine the following definite integrals: 


7 1/2 il 
a xcosxdx b ———. dz 
| fj zs 
e2 am /2 1 
] d d ——__———__d 
°) / asl | 4sinx + 3cos x * 


of ae 5] [et 1)ae 


1 


(Recall [a] is the integer part of x and M(z) denotes the mantissa.) 


9.10 Exercises 345 


Compute the area of the trapezoidal region of f(x) = |loga| restricted 
to [e“1, e]. 


17. Find the area of the region enclosed by y = f(x) and y = g(x), where: 


l=) | #@) = lal, g(a) = Vi—-a@? 


a 
b) f(x) = 2? —2z, g(z) = —z* +2 


Determine 
F(@)= | (t — 1] +2) dt. 
4 


9.10.1 Solutions 


1. Primitive functions: 


a) F(a) = x(x +1) +c; b) F(a) = be5* — be 3* 4.6. 


c) Since we can write 
e+1 Li. Qe 1 
etl 2e241 a +1 
it follows 
Fa) = 5 log(a? +1)+arctanz+c. 


d) F(a) = log |2a + cosa| +c. 


2. Primitives: 


a) The general primitive f(x) reads F(x) = nen +c. Imposing F(/2) = 1, we 
get 
1 lL, 


Lage +e whence Gal aTe, 


so the required map is 


b) (ge) = arctan 23+1; c) F(x) = t log? a d) F(a) =e8"*, 


3. Indefinite integrals: 


a) S = $log(z? +7) +e; b) S= 4 (6r+3)% +c. 


c) Changing y = + gives dy = —4 dx, hence 


346 9 Integral calculus I 


A) s= a ar +e. 
e) Set y= 1+”, so that dy = e* dx and 


S= [vie=3 243/24 =; (l+e")3 +c. 
f) S=Va2+7+ ¢. 


4. Indefinite integrals: 
a) S=(2—27)cosx+2zsinz +c; b) S = ga (log 2x — $) +c. 


c) We integrate by parts choosing f(x) = log” x and g(x) = 1. Then f’(x) = 
2 log toe) =a, diving 


S =alog’? x — 2 flog rae. 


The integral on the right requires another integration by parts (as in Example 
9.11 ii)) and eventually leads to 


S = clog’ x — 2a(log x — 1) +c = x(log? z — 2logr +2) +c. 


1 


d) We take f(x) = arctan x, g'(x) = x and integrate by parts. Since f'(r) = = 
2 


and g(x) = $2 : 


2/ 1+22 


1 1 1 
se arctan % — >| (1 = a) dx 


= La? aret “r+ — arctan ¢ + 
— 5” arctan & 57 yo anv +c. 


1 1 2 
S= 5 arctan 7 — aes 


e) S = ge**(sing + 2cosz) +c. 
f) The remarks pee on p. 314 v) suggest to integrate S; = [ re dz by parts 


with f(z) = ea and g’(#) = 1. Then f’(xz) = —aEe o(t) =x, and 


1 Z a 
Ss, = | —sde = —, +2 | —~s4 
; bes 7 = Tye flier. . 


x z+1-1 xt 1 
et 25 dr. 
lta" | a or ange ere >| arm : 


The solution is then 


a x al x 
S= a a ee =5 SLE AG aoe +c. 


9.10 Exercises 347 


5. Indefinite integrals: 
a) S = 3log|x — 3| — log|z —1| +c. 
b) S = $43 + 2x + 2log |x — 3| — log |a —2| +c. 
c) Splitting into partial fractions 
Go ¢ _ A Br+C 
e—1  (@—l1)\(a2?+¢+1) 2-1 #?4+241’ 


yields A(x? ea ea pg eee oe ee ee 


find the constants A = C' = 3, while x = —1 determines B = — 3. Therefore 
x 1 iL x—1 
oat a) 
ih il Laer l—3 
=3(4-; oes 
1 1 1 2¢2+1 3 1 
-3(—4- D@aes1  Feplpss oe =a): 


In conclusion, 


S= a (108 - 1| — 5 log(a? +2 +1) + VBarctan (2+ 5) + C. 


d) S =log(x? + 4) + 3log |x — 2| —5log|x +2|+ Farctan$ +c. 


e) We search for the undetermined coefficients in 


amas atau A B C 
lt a Sg ea 
—2£ x 


je 
Choosing x = 1 and x = 0 for 
Ax? + (Ba +C)(«£—1)=27+41 


produces A = 2,C' = —1, while x = —1 tells B = —1: 


1 1 1 1 

5= / stl+——---5 dx = =2?+2+2 log |z—1|—log|x|+—+c. 
1 «a @ 2 x 

f) The integrand splits as 


20° — 227 +7e+3 A ,_B_ , Cx+Dd 
(a2 4+4)(2—-1)2 2-1 (@-1)2 2447 


Imposing 


A(x —1)(z? + 4) + B(x? + 4) + (Ca + D)(x — 1)? = 203 — 22? ++ 724+ 3, 


348 9 Integral calculus I 


leads to A= 1, B=2, C=1,D=-1, hence 


2 xr—-1 
S= Ma G-1 5+ Sq) ee 


2 1 1 
ea acer 


6. Indefinite integrals: 
a) Put y =e”, then dy = e® dz, and 


Yy 1 
ae ee oe ne 
[a y I( —) y 


y—logly+1}+c 
=e” —log(e” +1) +c. 


b) S = 4a — Flog |e” —2|-4, +c. 


c) Changing t = tan $ we can write cosx = i. and dx = cS dt. Then 


1+ 
1 1 1 
S=2 | ———~-dt=2 — — —— | dt 
laos I(z ie) 
2 
= —-— — 2arctant+c= — aie eee 
t tan 5 
2 1+tan = 
d) $= —————— : S=1 2 
) Tttan= ) ee 1—tan § 


f) Set ¢=tanz, so sin? x = co cos? t = a and dx = GE dt. From that, 


1 A B Ct+D 
= ———————— di= — —_— ———_ : 
Besa cesrs (ert rar) a 


Now evaluate at ¢ = —1, t=1, t=0 and t = 2 the condition 
A(l —t)(1+ #7) + BI1+2)(14+ #7) +(Ct+D)1-#) = 


to obtain A= 4, B= 4,C =0,D=$. Then 


it il ik 
= zlog|i+t|— 7 log|1—t|+ 5 arctant +c 
1 i oe ee i sin © + COs x 
= — log |———_| + = arctant = — log |.————_—_- 
7 toe | + 5 arctan +e a ee) 2 


1 
+=-@2+C. 


9.10 Exercises 349 


7. Indefinite integrals: 
a) S=2,/2+2)8-4/2+a+4+¢; b) S=-sq¢gy tc. 
c) With t? = 3 — x we have x = 3 — ¢” and 2tdt = —dz, so 


2t 1 


d) By definition sinh z = “=—, so y = e® yields 


2 1 1 
-{aew-f/ (4-5) dy 


ate] Cc. 


= log |y — 1| — log |y + 1| + c= log 


e) S = 4 (fe? — fe-2" 4 2a) +c = Fsinh2a4+ fa+c. 
f) Observe log V1 4 22 = 5 log(1 + x27). We integrate by parts putting f(x) = 
log(1 + x”), g(x) =1, so f’(x) = oe and g(x) = a. Then 


ax 


(stoe( nS a | —_ ar) 
(«tet +27) — 2 | (1 — =) ar) 


(x log(1 + a?) — Qa + 2arctanz) +c. 


ea) 
| 


Wl wlre wile 


g) S =} (log|1+tana| — $log(1+tan?z)+2) +c. 
h) Setting y = e*” implies dy = 4e*” dx, dx = uy dy. Thus 


if 1 1 1 1 
Ce a ge) Pe | 
Sercess ’ uf G eat ’ 
1 1 
=a (log |y| — log|y + 1]) +e= q4e —log(e*” +1)) +e 


1 
=i zlos(e® +1) +e. 


i) Because 


5 4 


sin? z = sinz sin* x = sin x(1 — cos’ x)?, 


choosing y = cos has the effect that dy = — sina dx and 


psnbear =- fa-vPay= [14+ 29? - yey 


2 3 1 5 2 3 i 5 
=-yrtrsy—-ry a aa atc. 


3 5 


350 9 Integral calculus I 


() Given that cos* x = cos x cos? x, let us integrate by parts with f(x) = cos? x 
and g/(x) = cosa, implying f’(x) = —3sinz cos? x and g(x) = sinz. Thus 


S = f costede =sinecos'x +3 f cos? xsin? x de 
— 3 2 2 
= sin x cos +3 | cos x(1 — cos* x) dx 
=sinxeos'e +3 cos? rd — 38. 


Now recalling Example 9.9 ii), 


1 1 
4S =sinxcos*x+3 (50 + 7sin2z) +c. 
Finally, 


1 3 3 
cos‘ x dx = 7 sine cos* x + get 1g nea +e. 


8. Note that f(x) is defined on (—oo, 2), where 
glog(2—a2) if0<a2<2, 
A= ; 
—arlog(2—ax) ifa<0. 


In order to find a primitive, compute the integral { xlog(2 — x) dx by parts. Put 


g(x) = log(2 — x) and h’(x) = , implying g/(x) = 4, h(x) = $2”, and 


2 x—2 


ion 1 4 
~ 7? log(2 — x) — = 24 —— 
xt og(2 — x) 5 | (2+ +5) de 


1 1 
5% log(2 z) zz x — 2log(2—2)+c. 


1 1 : 
| et08(2 ~ 2) de = 52? I0g(2 - 2) ~ 5 | “de 


Thus 


F(e) = $x” log(2— 2x) — fa? -—ax—-2log(2-az) +c, if0<a<2, 
—527 log(2— 2) + G2? +"42log(2—-x)+c. ifa2<0. 


The constraint F'(1) = 0 forces c; = 3, and since F' must be continuous at x = 0 
it follows 


5 
F(0+) = -2log2+ i= F(0-) = 2log2+ co. 


This gives cp = —4log2+ 3, and the primitive is 


$x” log(2 — x) — $a? — x — 2log(2— 2x) + 3 rose <2. 
F(«) = 


—tu* log(2— x) + 4a? +a+42log(2—x)—4log2+3 ifa <0. 


9.10 Exercises 


9. We write, equivalently, 
ge" Te 0, 
oa. 
xse> ifa<0O. 
With Example 9.11 i) in mind, 


—(r#+lje"7 +c, ifa>0, 
F(x) = 
(x — 1)e” +c to Os 


351 


Continuity at « = 0 implies F(0) = F(0*) = cy = F(0~) = cg, so the generic 


primitive of f is 
—(a@+lje* +c ifx>0, 
Mee 
(x —1)e* +e if <0, 


ie., F(x) = —(\z| + 1)e#! + c. Additionally, 


lim F(x)= lim (-(#+l)e*+c) =c, 


xL—+00 w—->+00 
meaning that the condition lim F(x) = —5 holds when c = —5. The required 
w—->>+0O 


map is 
F(a) = —(|2|+1e!*!—5. 


10. Integrate the two cases 


e+2 
————— ifx>0 
GEG—=s) 

F(a) = e+2 


separately, that is, determine 


1 ie ae i 
s= | oye” and Sy = f Rar. 


These are rational integrands, so we ought to find the partial fractions first. Rather 


easily one sees 


+20 A BB 1 1 5 
(c+3)\(x-3) 2+3 2-3 6\2+3 2-3 
C2 A re B o1 e 5 
(2-3)? 2-3 (2-3)? x—-3 (2—3)?’ 
whence 
1 5 
Si = ¢ (log|z + 3] + 5log |x —3|) +e1, Sz = log |x — 3|— 3 + 2. 


A primitive of f has then the form 


352 9 Integral calculus I 


S ife >0, 
F(e)=) eg 
—Sg if -3<2<0 
1 
g logle + 3] + Slog |x — 3) + e1 ie> 0; 
~ 5 
— log |x — 3| + — ++ e2 i323 <2 <0, 
2—3 


Continuity and the vanishing at x = 0 tell 
5 
0 = F(0) = F(0T) = log3 +c; = F(0-) = —log3 — gt C2. 
Thus cy = — log 3, co = log3 + 3, and 


1 
g (loa(x + 3) + Slog |a — 3)) — log3 ifg¢> 0, 
= 5 5 
—| — —-+log3+ = if —3 0. 

og(3 x) + —— t log +3 i Sie 
11. The generalised primitive F(a) of f(x) should be continuous and satisfy 
F’(x) = f(a) at all points where f(x) is continuous, in our case every x # 1. 
Therefore 


[ex 5043) az te aa ee if x > 1, 
F(z) = = io - 
[@z-7)ae ife< 1 Qn? — Tx + ep ifx <1; 


the relation of c1,c2 derives from imposing continuity at x = 1: 
Fl) = F(t) =14+c¢=F(1-)=—-5 +e. 
Thus co = 6+ c, and 


La. 25 ; 
— ——— pes 
F@)= 5 5 +3x+c ifa>1, 


Qn? —Te+6+e Tee 
Let us demand F'(0) = 6+ c= 0, ie., c= —6. That implies 


1, 5 9 * 

~f°- = se=6 122 1 
F(2) = av a roe wars l, 

207 — Tx te< 1. 


Alternatively, notice the required map (cf. Remark 9.41) equals 


F(z) = / * p(t) at. 


from which we may then integrate f(t). 


9.10 Exercises 353 


12. Consider F(x) = arctan + and G(x) = — arctan x. As 


iL 


F(z) = (aq G'(x), 


F(a) and G() are primitives of the same f(x) = To As such, Proposition 9.3 


ensures they differ by a constant c € R 


F(x) = G(z) +c. 


The value c = 3 is a consequence of F(1) = 4, G(1) = —4. 
13. The generic primitive for f is like 
F(g)= c+ [ coszitat. 
0 
By Lemma 9.438, if 
cos 2¢7 = 1 — 2444 =i +o(t?), +t-0, 


F expands, for x > 0, as 


r 2 2 2 
F(x) =c+ f (124+ 228 +o) dt=e+a— Fa? + = 2° +(e"). 
0 


14. As in Exercise 13, write first Maclaurin’s polynomial up to degree 3: 


1 23 \ + 
=-(2 ~*)(14+— 
f(a) a +e )( + | 
= 3—a+ ee <x? + o(#3) ) (1 — <2? + o(23) 
3 2 6 
1 1 
=5 (8-245 2 2° + ofa) 
1 
=a G + ea’ — Ta" + fz"), x70. 


Then 
: : 1 1, 7.3 3 
Fiz)=c+ ]f ft)dt=ec+ 1—=-t+ —t* — —t’+o(t”) } dt 
0 0 3° 6 18 


1 il rg 
=¢che= ae + a zat t+ o(x"), x0. 
15. Definite integrals: 


a) —2; b) Z; c) e7(3e? — 1); d) Zlog6. 


354 9 Integral calculus I 


e) Since 
1 ifl<a<2, 
sa < 2 a2 oe Se, 
3 ife=3, 
we have 


2 3 
1 5 


f) The parabola y = 2? — 1, on 0 < x < V3, has the following range set: 


—l<2?-1<0 for x € [0,1) 
O0<2?=1<1 for x € [1, V2) 
1<eH=1<2 for a € [V/2, V3). 
Therefore 
z?—1+1 ifxe (0,1), 
24 if x € (1, V2), 
M(a?-1)=4~ ees he 
a?—1-1 ifxe[V2,V3), 
0 ife= V3, 
and 


1 v2 v3 
s=f star+ [ @?—1de+ f (x? — 2)dx = V2—- V3 +1. 
0 1 v2 


16. As (see Fig. 9.14) 


logy ifie <7 <1. 
[log x| = 
log x ifl<a<e, 


a 


Figure 9.14. Trapezoidal region of the function f(x) = |logz| 


9.10 Exercises 355 


Figure 9.15. Region of Exercise 17 a) 


from Example 9.11 ii) we infer 


e 1 e 
a= | Hlogr|dx =~ [ logrde +f log x dx 
el el 1 
1 
=— [nog « — ) 


e- 


17. Computing areas: 


a) The region is symmetric with respect to the y-axis (Fig. 9.15). Comparing to 
the example of Sect. 9.9.1, we can say that the area will be 


2/2 
anal { (ViT# ar] 


fe at 


a E 1 — x? + arcsin z| 


The result agrees with the fact that the region is actually one quarter of a disc. 

b) 2. 
18. From 

Lo “itt. 

jé-1|= 

t=) at; 

we write 
/ (1—t+2)dt ite =< 1, 
—1 
F(x) = 


1 x 
f a-tenacs [ G14 dE Gees 
1 


-1 


1 7 
a0 18th ito 1, 


1 9 


10 


Integral calculus II 


This second chapter on integral calculus consists roughly of two parts. In the 
first part we give a meaning to the term ‘improper’ integral, and thus extend the 
notion of area to include unbounded regions. The investigation relies on the tools 
developed when discussing limits. 

The remaining part is devoted to the integration of functions of several variables 
along curves, which generalises the results on real intervals of Chap. 9. 


10.1 Improper integrals 


Hitherto integrals have been defined for bounded maps over closed bounded inter- 
vals of the real line. However, several applications induce one to consider unboun- 
ded intervals quite often, or functions tending to infinity. To cover such cases the 
notion of integral, be it Cauchy’s or Riemann’s, must be extended by means of 
limits. 

We begin with improper integrals with unbounded domain of integration, and 
then treat infinite integrands. 


10.1.1 Unbounded domains of integration 


Let Rioc([a, -+co)) be the set of maps defined on the ray [a,+oo) and integrable 
on every closed and bounded subinterval [a,c] of the domain. 
Taking f € Rioc({a, +oo)) we can introduce the integral function 


F(c) = [1 ue 


on [a, +00). The natural question to answer concerns its behaviour when c + +0c0. 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_10, 
© Springer International Publishing Switzerland 2015 


358 10 Integral calculus I 


Definition 10.1 Let f € Rioc([a, +00)). We (formally) set 


Foo 
[ 


The symbol on the left is said improper integral of f on [a, +00). 


i) If the limit exists and is finite, we say that the map f is integrable over 
[a, too), or equivalently, that its improper integral converges. 

ii) If the limit exists but is infinite, we say that the improper integral of f 
diverges. 

iii) If the limit does not exist, we say that the improper integral is inde- 
terminate. 


The class of integrable maps over [a, +00) will be indicated R([a, +00)). 
Visualising the improper integral of a positive function is easy. Note first that 
the following holds. 


Proposition 10.2 Let f € Rioc([a,+00)) be such that f(x) > 0, for all 
+00). 


x € la, +c0o). Then the integral map F(c) is increasing on {a, 


Proof. Take ci,c2 € [a,-+oo) with c1 < cg. By the property of additivity of the 
domain of integration (Theorem 9.33, i)), 


Fla) = f fa)av= | payac+ | ; f(e)de 
a Pei: | Hadaee. 


The last integral is > 0 by Theorem 9.33, iii). Therefore F'(co) > F'(c1). 


Corollary 10.3 The improper integral of a positive map belonging to 


Rioc([a, +00)) is either convergent or divergent to +co. 


Proof. This descends from the proposition by applying Theorem 3.27 to F. 


Going back to the geometric picture, we can say that the improper integral of 
a positive function represents the area of the trapezoidal region of f over [a, +00) 
(Fig. 10.1). This region is unbounded and may be viewed as the limit, for c > oo, of 
the regions defined over the subintervals [a,c]. The area of the trapezoidal region 
over the entire domain of integration [a,+o0) is finite if the improper integral 
converges, and one says that the area is infinite when the integral is divergent. 


10.1 Improper integrals 359 
A 


a 


Cc ame.) 


Figure 10.1. Trapezoidal region of f over the unbounded interval |a, +00) 


Examples 10.4 


i) We consider the integral over [1,+00) of the family of functions f(x) = — 
for various a > 0. Since 


1 
eo 
° gi-a c : Z ; cl-a _ l 
[ave (a  ° "ea gag” 9 oro 
oe : 
; logz|j ifa=1 log c ifa=1, 
when a # 1, one has 
+-0o l-a 1 
1 =| iw, 
| —dzx= lim oe sl 
1 xe c3+o. l—-a 
+00 ifa<1l. 
If a = 1 instead, 


Cc 


+oo 
/ —dz= lim logc=+o. 
1 x — +00 


The integral behaves in the same manner whichever the lower limit of integration 
a > 0. Therefore 


+0o 7 converges if a> 1, 
/ — dz 
a de 


diverges ifa<l. 


ii) Let f(x) = cos az. The integral 


Fe)= | cosz dx = sinc 
0 


does not admit limit for c + +00, hence ie cos x dz is indeterminate 


Improper integrals inherit some features of definite integrals. To be precise, if 
f,g belong to R([a, +00)): 


360 10 Integral calculus II 


i) for anyc>a 


[Oo serers [teyars [ seorae: 


ii) for anya,BER 


+oo +oo -+-0o 
| (ate) + B9(0)) av =a fo fejae+ f° g(a)ae; 
iii) supposing f > 0 on [a, +00) then 
[Osea ls 


All are consequence of properties i)-iii) in Theorem 9.33 and the properties of 
limits. 


Convergence criteria 

The integrability of f € Rice ([a, +oo)) cannot always be established using just the 

definition. Indeed, we may not be able to find an integral function F'(c) explicitly. 

Thus, it becomes all the more important to have other ways to decide about con- 

vergence. When the integral is convergent, computing it might require techniques 

that are too sophisticated for this textbook, and which will not be discussed. 
The first convergence test we present concerns positive functions. 


Theorem 10.5 (Comparison test) Let f,g € Rioc([a, +00)) be such that 
0 < f(x) < g(x) for all x € [a, +00). Then 


+00 +00 
0 < | Piaydg </ g(x) da. elle 


In particular, 


i) if the integral of g converges, so does the integral of f; 
ii) if the integral of f diverges, then the integral of g diverges, too. 


Proof. The definite integral is monotone, and using 0 < f(x) < g(x) over [a, +00), 
we have 


re = [sears f g(a)ae = G00). 


By Corollary 10.3 the maps F(c) and G(c) admit limit for c + +00; 
comparing the limits, with the help of Corollary 4.4, we obtain 


0< lim F(c)< iim Gc), 


c—+oo 


which is (10.1). The statements 7) and ii) are straightforward consequences 
of (10.1). 


10.1 Improper integrals 361 


Example 10.6 
Discuss the convergence of the integrals 
+oo +oo 
arctan © arctan © 
/ —— and / —— _ dz. 
1 tv 1 x 
For all x € [1, +00) 


T Z i e T 
— <arctanzg < = 
4 2° 
sO 
arctan x e T d T eZ arctan x 
—————{ < — an — —_.. 
v2 Qx2 Ar — x 
Therefore 


+00 +00 +00 +00 
arctan & T T arctan & 
/ ar [ —zdz and / pars | ——— da. 
1 av 1 22 1 An i xv 


+oo +oo 
From Example 10.4 we know / ay converges, whereas / "dx di- 
1 Qx2 1 Ag 


verges. Because of Theorem 10.5, the implication of i) ensures that the integral 


F° arctan x ae + arctan x 
—z— dx converges, while ii) makes —— dz diverge. 
1 r 1 x 


When the integrand has no fixed sign, we can rely on this criterion. 


Theorem 10.7 (Absolute convergence test) Suppose f © Rioc([a, +00)) 
is such that |f| € R([a,+o0)). Then f € R([a,+00)), and moreover 


[stereo] sf irolec. 


Proof. We introduce f, and f_, respectively called positive and negative part 
of f, as follows: 


f(a) if f(z) 20, 


f(x) = max(f(2),0) = ‘3 aloe 


" if f(z) > 0, 
—f(x) if f(x) <0. 


Both are non-negative, and allow to decompose f, | f|: 


f(x) = fre) — f(a), |F(x)| = fe(@) + f-(@) (10.2) 


362 10 Integral calculus I 


y = f+(2) 


Figure 10.2. Graphs of a map f (left), its positive part (centre) and negative part (right) 


(see Fig. 10.2). Adding and subtracting these relations leads to 


fale) = Eel 0) f(a) = e= fe) 

which, together with Theorem 9.33 ii), imply f4,f_ © Rioc([a, +00)). 
Since 0 < fi(x), f_-(x) < |f(x)| for any x > a, the Comparison test 10.5 
yields that f, and f_ are integrable over [a, +00). The first of (10.2) tells 
that also f satisfies the same. 

Eventually, property v) of Theorem 9.33 implies, for all c > a, 


[ seae| s | \ptolae 


Passing to the limit c +00 proves the claim. 


Example 10.8 


Let us consider the integral 


is integrable on [1,-++co) by 


1 cos 
— S 5 the function |f(x)| = | 
x x x 


Since | 


Theorem 10.5 and Example 10.4. The above test guarantees integrability, and 
+oo +oo +oo 1 

/ oe < | SF ]ers | Pee, 7 
1 x 1 x 1 x 


Remark 10.9 The Absolute convergence test is a sufficient condition for integ- 
rability, not a necessary one. This is clarified by the following example 


+-O0: 8 +oo ‘ 
| me dx converges, but | canal WE diverges. (10.3) 
1 v 1 £ 
(For a proof we refer to Appendix A.5.3, p. 470.) Oo 


A map f whose absolute value |f| belongs to R([a,+co)) is said absolutely 
integrable on [a,+co). 


10.1 Improper integrals 363 


Another useful result is based on the study of the order of infinitesimal of the 
integrand as 7 — +00. 


Theorem 10.10 (Asymptotic comparison test) Suppose the function 
f € Rioc([a, +00)) is infinitesimal of order a, for x + +00, with respect 
to p(x) = +. Then 


i) ifa>1, f € R([a,+00)); 
+00 
1 ale i f(x) da diverges. 


a 


Proof. See Appendix A.5.3, p. 471. 


Examples 10.11 


i) Consider 
+00 
7 (7 — 2arctan x) da. 
1 


The map f(x) = a — 2arctan is infinitesimal of first order for x — +-oo: by de 
l’Hopital’s Theorem namely, 


_ w—2arctane 2x 
i $< lim = 
Z—>+00 l/x a>tool+ a? 


The integral therefore diverges. 


[- x + cos x 
= 

1 x? + sin ax 

As cos x = o(z), sinx = o(x?) for x + +00, it follows 
x + COS £ 1 
=o SS oe PO, 
z+sinzc 2 

and the integral converges. 


ii) Discuss the integral 


Let us now consider a family of improper integrals generalising Example 10.4 i). 


Example 10.12 


We show how the convergence of 
+co 1 
——— d 
| 1 (logan) 
depends on the values of a, 6 > 0. 


i) The case a = 1 can be tackled by direct integration. Changing variables to 
t = log x, one has 


364 10 Integral calculus I 


+00 1 +00 i 
——— dx = — dt 
i x (log x)? 7 [ 2 i? 


so the integral converges if 6 > 1, diverges if 6 < 1. 


ii) If wa > 1, we observe preliminarily that x > 2 implies log x > log 2 and hence 
1 1 
= 
ne logae a (oe2)\F 
This is sufficient to conclude that the integral converges irrespective of 6, by the 
Comparison test. 


Va > 2. 


iii) When a < 1, let us write 


i tae 
z*(logx)8 ~— x (log r)8" 
l-a 
The function yB tends to +oo, for any (@. There is thus an M > 0 such 
ex 


that 
1 M 
ee 
z@(loga)? ~ @ 


By comparison the integral diverges. 


If f is defined on |[ko, +00), it could turn out useful, sometimes, to think of its 
value at x = k as the general term a, of a series. Under appropriate assumptions 
then, we can relate the behaviour of the series with that of the integral of f over 
[ko, +00), as shown hereby (a proof of this result may be found in Appendix A.5.3, 
p. 472). 


Theorem 10.13 (Integral test) Let f be continuous, positive and decreas- 
ing on |ko, +00), for kg € N. Then 


+oo 


~ f(b) < | d= ye Sik) (10.4) 


ees ko a 


therefore the integral and the series share the same behaviour. Precisely: 


+oo 


a) f(x) dx converges <=> Ss" f(k) converges; 
Ko k=ko 


b) f(ax)dax diverges <> Se f(k) diverges. 
ko =p 


Examples 10.14 


i) The previous criterion tells for which values of the parameter a the general- 
ised harmonic series 


10.1 Improper integrals 365 


converges. Note in fact that a > 0, satisfies the theorem’s hypotheses, and 


ae 


has convergent integral over [1, +00) if and only if a > 1. Therefore 


{ converges for a > 1, 


diverges forO<a<1l. 


ii) In order to study 


ee 


we take the map f(x) = ; its integral over [2,-++co) diverges, by case i) of 


gxlog x 


is divergent. 


=. 
Example 10.12. Then the series S- Pick 


A last remark to say that an integral can be defined over (—oo, b] by putting 


b b 
/ f(e)de= lim f(x) da 


Cc —Co 
ze 


All properties and convergence results easily adapt. 


10.1.2 Unbounded integrands 


Consider the set Rioc([a, 6)) of functions defined on the bounded interval [a, b) and 
integrable over each closed subinterval [a,c], a<c< b. 
If f € Rioc([a, b)) the integral function 


=f fa)an 


is thus defined over [a,b). We wish to study the limiting behaviour of such, for 
cob. 


Definition 10.15 Let f € Rioc({a, b)) and define, formally, 


i fi2jdr = jim " f(a) 4 de: (10.5) 


as before, the left-hand side is called improper integral of f over [a,b). 


366 10 Integral calculus I 


i) If the limit exists and is finite, one says f is (improperly) integrable on 
(a,b), or that its improper integral converges. 

ii) If the limit exists but infinite, one says that the improper integral of f 
is divergent. 


iii) If the limit does not exist, one says that the improper integral is inde- 
terminate. 


As usual, integrable functions over [a,b) shall be denoted by ?([a, b)). 


If a map is bounded and integrable on [a, b] (according to Cauchy or Riemann), 
it is also integrable on |a, b) in the above sense. Its improper integral coincides with 
the definite integral. Indeed, letting M = sup |f(«)|, we have 


xe [a,] 
[ seae- [see 


| or 


In the limit for c + b~ we obtain (10.5). This is why the symbol is the same 
for definite and improper integrals. At the same time, (10.5) explains that the 
concept of improper integral over a bounded domain is especially relevant when 
the integrand is infinite in the neighbourhood of the point b. 


b 
< f \f@lar< Me- 0). 


Example 10.16 


1 
Take f(x) = with a > 0 (Fig. 10.3 shows one choice of the parameter), 
v a 


(b— 
and study its integral over |a, b): 
i= l-a |¢ 
———-dr= a—-l |, 
—log(b-2)|, ifa=1 
_ ,\l-a _ _ l-a 
(b—c) (b—a) ifaXl, 
_ a-—l 
b— 
log . = ifa=1. 
—C 
When a ¥1, 
- - b— 1-a 
[ ih eee (b—c)'-* —(b-a)! _ ( 9) i Opes 
a (D—2)* c3b- a—1 =, 
+ oO ifa>1 
For a = 1, 


10.1 Improper integrals 367 


2 


Figure 10.3. Trapezoidal region of the unbounded map f(z) = 


Therefore 


{ converges if a <1, 
se 


diverges ifa>1. 


In analogy to what seen previously, the integral of a positive f over [a,b) can 
be proven to be either convergent or divergent to +o. 


Convergence tests similar to those already mentioned hold in the present situ- 
ation, so we just state a couple of results, without proofs. 


Theorem 10.17 (Comparison test) Let f,g € Rioc(|a,b)) be such that 
0 < f(x) < g(x) for any x € [a,b). Then 


b b 
o< | fla)ar < f g(x) dx. (10.6) 


In particular, 


i) if the integral of g converges, the integral of f converges; 
ii) if the integral of f diverges, the integral of g diverges. 


Theorem 10.18 (Asymptotic comparison test) [f f © Rioc({a, b)) is in- 
finite of order a for x —> b~ with respect to v(x) = -*, then 


b—-a’ 


i) ifa<1, fe R((a,b)); 
b 
Ly Ce ii f(x) da diverges. 


368 10 Integral calculus II 


Integrals over (a, b] are defined similarly: 


With the obvious modifications all properties carry over. 


Examples 10.19 
i) Consider the integral 


3 
‘ia? 
4/ dan, 
| 3= 2 . 
The function f(z) = vis is defined and continuous on [1,3), but has a dis- 


ih 
continuity for 7 > 37 —x<6on r- ye = the Comparison test we have 


i fa 4 Q=e° 
\3 ga =e 


(recall Example 10.16). The aes) therefore converges. 


ii) Consider 
2 
er 
— dz. 
Lea” 


e+1 Z e* +1 
(ea1e a1) 
so by comparison the integral diverges to +00. 


When z € (1, 2], 


iii) Determine the behaviour of 


1 
For z > 07, f(z) = ve ~ —=, therefore the integral converges by the Asymp- 
sinz Vx 


totic comparison test . 
[ log(a — 3) Fr 
ay GG 
, «3 — 8a? 4+ 16x 


has integrand f defined on [7,4); f tends to +00 for > 47 and 
log(1 + (a — 4)) 1 

f(x) = “A 

a(x — 4) A(x — 4) 

Thus Theorem 10.18 implies divergence to —oo (f(x) = 1/(x — 4) is negative at 

the left of x = 4). Oo 


iv) The integral 


ra4. 


10.2 More improper integrals 369 


10.2 More improper integrals 


Suppose we want to integrate a map with finitely many discontinuities in an inter- 
val I, bounded or not. Subdivide I into a finite number of intervals Jj, 7 = 1,...,n, 
so that the restricted map falls into one of the cases examined so far (see Fig. 10.4). 
Then formally define 


One says that the improper integral of f on J converges if the integrals on the 
right all converge. It is not so hard to verify that the improper integral’s behaviour 
and its value, if convergent, are independent of the chosen partition of I. 
Examples 10.20 


i) Suppose we want to study 


+00 1 
— —— dz. 
[. 1+ 2? . 


If we split the real line at the origin we can write 


0 1 +0o 1 
S= —_d —_ dz; 
ie v+ | 1+a2°° 


the two integrals converge, both to 7/2, so S = 7. 


ii) The integrand of 


Ig 3 


Figure 10.4. Trapezoidal region of an infinite map, over an unbounded interval 


370 10 Integral calculus I 


is infinite at the origin, so we divide the domain into (0, 1] U [1, +00), obtaining 
{2% +oo _: 
sin x sin x 
0 & 1 L 


1 
~— forx 307 and 
x 


but 


1 
ag 


y] 


so Theorem 10.18 forces the first integral to diverge, whereas the second con- 
verges by Theorem 10.5. In conclusion $; tends to +00. 
For similar reasons 


converges. 


iii) Let S' denote 


6 
—5 
——— ae 
1 (©+1)V2?2 —624+8 
The integrand diverges at —1 (which lies outside the domain of integration), at 
2 and also at 4. Hence we write 


s=(f +f +f ‘)eayeaet cep =" 


The function is infinite of order 1/3 for  — 2* and also for x + 4*, so the 
integral converges. 


10.3 Integrals along curves 


The present and next sections deal with the problem of integrating over a curve, 
rather than just an interval (see Sect. 8.4). The concept of integral along a curve — 
or path integral as it is also known — has its origin in concrete applications, and is 
the first instance we encounter of an integral of a function of several real variables. 

Let y : [a,b] > R?@ (d = 2,3) be a regular arc and C = 7¥/([a,b]) its image, 
called a path. Take f : dom f C R? > R a function defined at least on C, hence 
such that CC dom f. Suppose moreover that the composite map fovy: [a,b] > R, 
defined by (f 0 y)(t) = f (¥(¢)), is continuous on [a, 6]. 


Definition 10.21 The line integral of f along vy is the number 


[r= f soo) ioe, (10.7) 


where ||’ (t)|| = +/|2’(t)|? + ly’ @)l2 + |2z’()|? ts the modulus (.e., the Euc- 
lidean norm) of the vector -y'(t). Alternative expression are ‘path integral of 
f along y’, or simply, ‘integral of f along -y’. 


10.3 Integrals along curves 371 


The right-hand-side integral in (10.7) is well defined, for the map f ((¢)) ||-7’ ()|| 
is continuous on |[a, b]. In fact + is regular by hypothesis, its components’ first de- 
rivatives are likewise continuous, and so is the norm ||7y‘(t)||, by composition. And 
recall f (+(t)) is continuous from the very beginning. 

Integrals along curves have the following interpretation. Let -y be a simple arc 
in the plane with image C' and f a non-negative function on C' with graph 


U(f) = {(a,y,z) € R®: (x,y) € dom f, z = f(a,y)}. 


By 


y= {(z,y,z) € ER?®: (2, y) €C, O<z<f(a,y)} 


we indicate the upright-standing surface bounded by C and by its image f(C) lying 
on the graph of f, as in Fig.10.5. One can prove that the value of the integral of 
f along +y equals the area of »’. For example if f is constant on C’,, say equal to h, 
the area of » is = proguct of the height h times the base C’. Accepting that the 
base measures ¢(C' =f \|-y’ (¢)|| dt (which we shall see in Sect. 10.3.1), we have 


b 
Area(S) =he(C) = | f((b)) I @)l at = [ f. 


Examples 10.22 


i) Let y : [0,1] > R? be the regular arc y(t) = (t, t?) parametrising the parabola 
y = x” between O = (0,0) and A = (1,1). Then 7(t) = (1,2t) has length 
Il’ (t) || = V1 + 4¢?. If f : R x [0,-+00) > R is defined by f(a, y) = 3x+ /y, the 
composition f oy reads f (y(t) = 3t + Vt? = 4t and therefore 


1 
jae) 4ty/1 + 4t? dt. 
y 0 


dom f 


Figure 10.5. Geometric interpretation of the integral along a curve 


372 10 Integral calculus I 


Substituting s = 1 + 4t? we obtain 
: 2 5 4 
| = 2 | (iss 2|=s*/] = =(5V5 —1). 
Y 1 3 1 3 


ii) The curve y : [0,27] — R? parametrises the circle centred at (2,1) with 


radius 2, y(t) = (2+ cost,1+sint), so ||y'(t)|| = V4sin?t+4cos?¢ = 2 for 
all t. With the function f : R? > R, f(z,y) = (x — 2)(y—1) +1, we have 
f(y) = 4sintcost +1, and 


20 
jga2 (4sint cost + 1) dt = 2[2sin 2t+¢]0” =4r. 
Y 0 


If we represent the circle by some ¥ having the same components as yy but t 
varying in [0, 2k7] (i.e., winding k times), then 


2kr 
ip (4sintcost + 1) dt = 4k. Oo 
7 0 


Example ii) shows that integrals along curves depend not only on the image of the 
curve along which one integrates, but upon the chosen parametric representation 
as well. That said, certain parametrisations give rise to the same integral. 


Definition 10.23 Two regular curves y : I > R¢, 6: J > R® are called 
equivalent if there is a biyection py: J > I, with continuous and strictly 
positive derivative, such that 


d=7°Y, 


Ue OG =a O(a) for all me ei: 


Definition 10.24 Let 7 : I > R® be a regular curve. If —I is the interval 
{t ER: -t € I}, the curve —y : —I > R¢ defined by (—-y)(t) = y(t) is 
termed opposite to vy . 


Flipping the parameter means we can write (—y) = yoy, where y : —I > I is the 
bijection y(t) = —t that reverts the orientation of the real line. If + : [a,b] + R@ 
is a regular arc, so is —7y over [—b, —a]. 

It is convenient to call congruent two curves 7 e 6 that either are equivalent 
or one is equivalent to the opposite of the other. In other words, 6 = yo » where 
y is a strictly monotone bijection of class C!. Since the values of the parameter 
play the role of ‘tags’ for the points on the image C of 7, all curves congruent to 
still have C as image. Furthermore, a curve congruent to a simple curve obviously 
remains simple. 


10.3 Integrals along curves 373 


Let f be a function defined on the image of a regular arc y : [a,b] + R@ with 
f 07 continuous, so that the integral of f along y exists. The map f o 6 (where 
6 is an arc congruent to yy) is continuous as well, for it arises by composing a 
continuous map between intervals with f 0+. 


Proposition 10.25 Let y : [a,b] + R@ be a regular arc with image C, f 
defined on C such that f oy is continuous. Then 


[oe [pt 


for any arc 6 congruent to +. 


Proof. Suppose (—‘)/(t) = —7y/(—t), so norms are preserved, i.e., ||(—y)’(t)|| = 


l(a), and 
ie. =f HC )) (-y'@)ll at 


Trae wins 


With the change of variables s = —t, ds = —dt, we obtain 


[t=- [ so@) bene 
= fs) tYeoias= fF 


Similarly, if 6d = yoy, where y: [c,d] — [a,}], is an equivalent arc to ¥, 
then 6’(r) = y/(y(r))y'(r) with y'(r) > 0. Thus 


[te [ue (r)) ||6(r)|| ar 


= [Foleo ln Ce lar 


c 


d 
=f Fret) lr @O) ar. 


By t = y(7), hence dt = y’(r)dr, we see that 


f= [roe wronae= fr 


374 10 Integral calculus I 


The proposition immediately implies the following result. 


Corollary 10.26 The integral of a function along a curve does not change 


if the curve is replaced by another one, congruent to it. 


Next, let us note that naming c an arbitrary point in (a,b) and setting y, = 


V\[a,c]> Y2 = VI fe,b), We have 
[t= [4+ f, (10.8) 
a 1 Yo 


because integrals are additive with respect to their domain of integration. 

Integrating along a curve extends automatically to piecewise-regular arcs. More 
precisely, we let  : [a,b] > R® be a piecewise-regular arc and take points a = 
dg <a, <...< Gm =6 so that the arcs ¥; = Yjjo,_,,a,), 2 = 1,---,m, are regular. 
Suppose, as before, that f is a map with domain containing the image C' of -y and 
such that f oy is piecewise-continuous on [a,b]. Then we define 


fe-vfe 


coherently with (10.8). 


Remark 10.27 Finding an integral along a piecewise-regular arc might be easier 
if one uses Corollary 10.26. According to this, 


(10.9) 


where 6; are suitable arcs congruent to y,;, 7 = 1,...,n, chosen to simplify com- 
putations. 


Example 10.28 


We want to calculate Sy x, where ¥ : [0,4] + R? is the following parametrisation 
of the boundary of ae unit square [0, 1] x [0,1]: 


¥1(t) = (¢,0) f0<f< 1, 
¥yo(t)=(1,t—1) if1<t<2, 

ee s(t) =(3-t,1) if2<t<3, 
ya(t)=(0,4-t) if3<t<4 


10.3 Integrals along curves 375 


A A 
3 63 
y 
V4 A Y2 4 A A d2 
O V1 1 O 01 1 


Figure 10.6. Parametrisation of the unit square, Example 10.28 


(see Fig. 10.6, left). Let us represent the four sides by 
Ailij=y(t) Ost<sl, w=, 
62(t) = (1,t) O<t<1, 62~%, 
63(t) = (t,1) O<221, dN Fs; 
d4(t)=(0,t) O<t<1, b4~-%% 


(see Fig. 10.6, right). Then 


1 1 1 1 5 
[@=] ears [ ats | Pars [ Odt=-—. 
Y 0 0) 0 0 3 


10.3.1 Length of a curve and arc length 


The length of a piecewise-regular curve ¥ : [a, b] > R® is, by definition, 


(10.10) 


The origin of the term is once again geometric. A fixed partition a = to < ti < 

..,tn—-1 < ty = b of [a,b] determines points P; = y(t;) € C, i = 0,...,n. These 
in turn give rise to a (possibly degenerate) polygonal path in R? whose length is 
clearly 


Uo sects) => , dint FE -4; 
4=1 


376 10 Integral calculus I 


dist (Pi-1, P;) = ||P; — Pi-1|| being the Euclidean distance of two consecutive 
points. If we let At; = t; = ti-1, and 


Ax _ x(t;) = x(tj—-1) 
At). a 
(and similarly for the coordinates y and z), then 


|4A=—Fall 


Therefore 


2 


2 2 
Ole; bis ca eg ba) _ ( ) + (=) + (=) 2Mes 
24\i ae}, Vat), Ve), 


3 


Ax Ay Az 


which ought to be considered an approximation of the integral appearing in (10.11). 
Provided the curve is sufficiently regular (piecewise-regular is enough), one can in- 
deed prove that the supremum of ¢(to, t1,...,tn), taken over all possible partitions 
of [a, b], is finite and equals ¢(+). 

The length, as of (10.10), depends on the image C of the curve but also on the 
parametrisation. The circle x? + y? = r?, parametrised by y,(t) = (r cost, r sint), 
t € [0, 27], has length 


20 
L(y) - | Pdi =2ar ; 


a well-known result in elementary geometry. But if we represent it using the curve 
o(t) = (r cos 2t, rsin 2t), t € [0,27], we obtain 


20 
Lys) = / 2p dt =Anr, 


because now the circle winds around twice. Proposition 10.25 says that congruent 
curves keep lengths fixed, and it is a fact that the length of a simple curve depends 
but on its image C' (and not the parametrisation); it is called the length ¢(C) of 
C.. In the example, -y, is simple, in contrast to y.; as we have seen, ¢(C’) = ¢(7,). 


Let now + be a regular curve on the interval J. We fix a point to € J and define 
the map s:l—>R 


‘Oe i; lefilide. (10.12) 


10.3 Integrals along curves 377 


Recalling (10.11), we have 
L(V [#0,t)) if t >to, 
LV) if, = to 4 


In practice the function s furnishes a reparametrisation of the image of 7. As a 
matter of fact, 
s(th=|lY@|>0, Weel 


by the Fundamental Theorem of integral calculus and by regularity. Therefore s is 
strictly increasing, hence invertible, on J. Letting J = s(I) be the image interval 
under s, we denote by t : J — J C R the inverse map to s. Otherwise said, we write 
t = t(s) in terms of the new parameter s. The curve ¥ : J > R@, ¥(s) = y(t(s)), 
is equivalent to yy (and as such it has the same image C). If P, = +y(t1) is a point 
on C' and t; corresponds to s; under the change of variable, then we also have 
P, = 7¥(s1). The number s is called arc length of P;. 
Differentiating the inverse map, 


#() = Fy = Duo) Lo = OY 


whence a 
Iv (s) |] =1, Vsed. 


This expresses the fact that the arc length parametrises the motion along a curve 


with constant ‘speed’ 1. 


Remark 10.29 Take y: [a,b] > Ra ee curve and let s be the arc length as 
n (10.12), with to = a; then s(a) = 0 and s( =f ll’ (7) || dr = e(y). Using this 
special parameter, we have 


[is - [3 =f HH ioe co eee. : 


The notion of arc length can be defined to cover in the obvious way piecewise- 
regular curves. 


Example 10.30 


The curve y : R > R3, ¥(t) = (cost,sint,t) describes the circular helix (see 
Example 8.8 vi)). Since ||-y’(¢)|| = ||(— sint, cost, 1)|| = (sin? t + cos?t + 1)1/? = 
V2, choosing to = 0 we have 


y= | I lar = v2 f dpa Jit 


It follows that t = t(s) = v2 g s € R, and the helix can be reparametrised by 
arc length 


: (o0 4 4,4.) 


Cos 9 * sin 5 5) 


378 10 Integral calculus I 


10.4 Integral vector calculus 
The last section deals with vector fields and their integration, and provides the 


correct mathematical framework for basic dynamical concepts such as force fields 
and the work done by a force. 


Definition 10.31 Let Q indicate a non-empty subset of R?, d = 2,3. A 


function F : Q — R?¢ is called a vector field on 2. 


Conventionally f; : 2 — R,i=1,...,d, are the components of F’, written 
F = (fi,..., fa). Using the unit vectors 7, 7, & introduced in Sect. 8.2.2, we can 
also write F = fii+ foj ifd =2 and F = fii+ fog + fgk if d =3. 

Vector fields may be integrated along curves, leading to a slightly more general 
notion of path integral. Take a regular arc ¥ : [a, b] > R¢ whose image C' = ¥/((a, }]) 
is contained in 2. In this fashion the composition Foy: t+ F(y(t)) maps [a, } 
to R@. We shall assume this composite is continuous, i.e., every fi(7(t)) from [a, }] 
to R is a continuous map. For any t € [a,b] we denote by 


_ x) 
ly @ll 


the unit tangent vector to C at P(t) = y(t). The scalar function F, = F'-T, 


KAt) = (F . T) i)= F(+(t)) Ce) 


is the component of the field F' along the unit tangent to y at the point P = +(t). 


T(t) 


Definition 10.32 The line integral or path integral of F along y is 
the integral along the curve y of the map F;: 


[F-ap= fF. 
7 @) 


As the integral on the right equals 


b b 
F.- | F-r=] F(y(t))-r(t) |] at= | F(t) -7'() dt, 
i [ [ Fo) -rololla= [ FOw)-70 


the line integral of F' on + reads 


(10.13) 


10.4 Integral vector calculus 379 


Here the physical interpretation is paramount, and throws new light on the consid- 
erations made so far. If F’ describes a field of forces applied to C, the line integral 
becomes the work done by F during motion along the curve. The counterpart to 
Proposition 10.25 is 


Proposition 10.33 Let y : [a,b] > R@ be a regular curve with image C, and 
F a vector field over C such that F o-+ is continuous. Then 


[ F-ap=- F.dpP and [F-ap=fir-ap, 
Oy -~Y oy 6 


over any curve 6 equivalent to +. 


In mechanics this result would tell that the work is done by (resp. against) the 
force if the directions of force and motion are the same (opposite); once a direction 
of motion has been fixed, the work depends only on the path and not on how we 
move along it. 


Examples 10.34 


i) Consider the planar vector field F : R? + R? given by F(x,y) = (y,z). Take 
2 

the ellipse x + 4 =1, parametrised by ¥ : [0,27] > R?, y(t) = (3 cost, 2sint). 

Then F(+(t)) = (2sint, 3cost) and y/(t) = (—3sint,2cost). Therefore 


20 
[F-ap= | (2sint, 3cost) - (—3sint, 2 cost) dt 
a 0 
27 20 
=o (sin? t+ cos? 4) at = 6 (2 cos” t — 1) dt 
0 0 


Qn 
= 2 f cos? tdt — 127 = 0, 
0 


because 
QT 


27 ; 1 1 
| cos* tdt = [5+ Fsin2z| = 
0 2 4 § 
(see Example 9.9 ii)). 


ii) Let F : R? — R® be given by F(z, y, z) = (e*,2+y,y+2), and y: [0,1] — R? 
by y(t) = (t, t?, t8). The vector field along the path reads 


F(y(t)) =(e,t+07,7 +09) and y(t) = (1, 2¢, 327). 


Thus i 
LF ap= [ (ute ge ee) (1, tae de 

mf 0 
: 19 


= le’ +2? + #°) +3 + #)] dt=e+ F. Oo 
0 


380 10 Integral calculus I 


10.5 Exercises 
1. Check the convergence of the following integrals and compute them explicitly: 
+oo +oo 
1 x 
—+—_———- d b ——.d 
3) / x +3a+2 ) / (c+1)3°~ 


+00 1 


: 1 
, eee yates” 


2. Discuss the convergence of the improper integrals: 


T° sing 
a dx —5,———- dx 
) | aa/ a b)| [- log?( (2+ log?(2 +e”) 
"i ae = vy 
Cc) | xe” dx [ae 
| 


ee nr 1 
e) | de f) —— dx 
sin 7x 0 vsina 


x — 7/2 _(m@—a)logr © 
coonina yf 
9 coszxzysin xr |log(1 — sin x)| 
Study the convergence of 


s ———t 
n= = —— da 
2 + /(#2+3)" 


for varying n € N. What is the smallest value of n for which S;, converges? 


4. Determine a € R such that the integrals below converge: 


ame.) +00 
arctan x 1 
———d b ————.—_———— d 
2) / ke\e e Dye ln? + 5a? + 8x + 4|% a 


—oo 


+00 1 
C el da 
| eo(4+ 9x)? oe a | 


For which a € R does 


a x(sin(% — 2))% a 


v2 —A4 


converge? What is its value when a = 0? 


10.5 Exercises 381 


6. Tell when the following integrals converge: 


‘ | ficseei) toes | = de 
di 0 


7™ _ |] —sinnre 


+00 
1 x—2 i 
—— log ——d ee Eee Ea i 
2 Ye-2 eel a) f sinx — (4 + 27) log(e + z) . 
Compute the integral of 


(1+ 8y) 
J/l+y+4x2y 


along the curve y(t) = (t,t, logt) , t € [1,2]. 


f(a,y,z)= 


8. Integrate the function f(x,y) = x on the Jordan curve yy whose image consists 
of the parabolic arc of equation y = 4—x? going from A = (—2,0) to C = (2,0), 
and the circle x? + y? = 4 between C and A. 


Let y be the curve in the first quadrant with image the union of the segment 
from O = (0,0) to A = (1,0), the arc of the ellipse 4x? + y? = 4 between 
A and B= (2, v2), and the segment joining B to the origin. Integrate 
f(x,y) =x +y along the simple closed curve ¥. 


10. Integrate 
1 


Ma¥) = Say 


along the simple closed curve yy which is piecewise-defined by the segment 
from O to A = (V2,0), the arc of equation x? + y? = 2 lying between A and 
B = (1,1), and the segment joining B to the origin. 


Integrate the vector field F(x,y) = (a?,ay) along the curve y(t) = (t?,t), 
t € (0,1). 


12. Compute the line integral of the field F(x,y,z) = (z,y,2x) along y(t) = 
Ga" ,0°),@e (0,1). 


13. Integrate F(x, y,z) = (2/Z,2,y) along y(t) = (—sint, cost, t?), t € [0, $]. 


; v) eee the aes path + poe of the quad- 


Integrate F (x,y) = ae 


( 
rilateral of vertices A 


15. Integrate F(x,y) = (0,y) along the closed simple curve consisting of the seg- 
ment from A = (1,0), the are of circumference x? + y? = 1 between A 
( ), 


O to 
and B = v2 v2 the segment from B back to O. 


? 


382 10 Integral calculus I 


10.5.1 Solutions 


1. Convergence and computation of integrals: 


a) log2; b) $. 
c) The function f(z) = Wes is unbounded at x = 0 and x = 2. The point « = 0 


lies outside the domain of integration, hence can be ignored. We then split the 
integral as 


——— dg = ies | ———_ dr = $,+ S2. 
/ rVxr—2 | rVxr— 2 3 rVx—2 : . 
For x > 27, f(x) ~ CE EL 
comparison test, i.e., Theorem 10.18, S; converges. As for S2, let us consider 
f when x — +00. Because 


so f is infinite of order $ <1. By Asymptotic 


1 


f(a) ~ aps = gape? r+, 


Theorem 10.10 guarantees Sj converges as well. 
To compute the integral, let t? = x — 2, hence 2tdt = dx and x = t? + 2, by 


which is eS 
os 2 2 t |too 2 

S= a5 tt = — arctan | = —T. 
/ t?+4+2 V2 V2 10 2 


d) The integrand is infinite at « = 0, x = 4. The latter point is irrelevant, for it 
does not belong to the domain of integration. At x = 0 


1 
f(x) ~ -—— for «50, 


Ay/|2| 


so the integral converges by applying Theorem 10.18 to 


0 1 . 1 
Ce er | d So = ————- d 
: ‘a —2(x — 4) o ac 2 / /x(a — 4) 7 


separately. For $1, let us change t? = —x, so 2tdt = —dx and x —4 = —t? —4. 
Then 


1 
2 tj 1 
Sy --f oq dt =~ arctan 5| = — arctan 5. 


Putting t? = x in Sp 


1 al 
2 1 1 1 1 
65) a agee | ta Se lio 
: l= a (= =) 5 [les 


Therefore S = S$; + Sp = — (arctan $ + + log Bi 


10.5 Exercises 383 


2. Convergence of improper integrals: 


a) 
b) 


Converges. 
The map f(z) = OTS) has R as domain since 2 + e” > 2, Vx € R. It is 
then sufficient to consider 7 — +00. As 


log(2 + e”) = loge*(1 + 2e—”) = x + log(1 + 2e7”), 


it follows 


1 
——S— rT — : 
(a + log(1+2e-*))2 a?’ eile 


The integral converges by Theorem 10.10. 


f(x) = 


Converges. 


Over the integration domain the map is bounded. Moreover, 


Va >e. 


By the Comparison test (Theorem 10.5), the integral diverges. 


Converges; f) converges. 


TT 


The integrand is not defined at x = 0, 5, nor at 7. For x = 5 though, the 
function admits a continuous prolongation mapping 5 to —1, because if we 
put t = x — 5, then 


T T 
cos x = cos(t + By, = —sint = —sin(x — 5) 
and so _ 
G9 Tv 
vw) = — —v--l, t->-. 
H(z) cos xV/sin © 2 


Therefore the integral is ‘proper’ at « = >. From 


(Oa. — xr —>O0T, (ine LIT, 


2/8 2/n— x’ 


we have convergence by asymptotic comparison (Theorem 10.18). 


The map to be integrated is not defined for 7 = 0, 2 = 3, x = 7. In the limit 
z—>O0t, 


Pein Tw log x _, Flog 
flog — a)? ~ Va 


The map has no well-defined order of infinite with respect to the test function 


+; nevertheless, it is clearly infinite of smaller order than any power a with 
4 < a < l, since the logarithm grows less than + for any q > 0 when 


x — 0*. The Asymptotic comparison test (Theorem 10.18) forces the integral 
to converge around 0. 


384 10 Integral calculus I 


About the other points, for z + 4, the function tends to 0, so the integral in 
not improper at 5; when x — 7, we have 


ee (log 7)(a — 2) (log 7) (a — 2) 


a egy ee 1/2 
|log(1+sin(a—7))|!/2 | sin(a — )|!/2 


~ (log 7)(m — x) 


so the integral in x = 7 is proper because f goes to 0. Eventually then, the 
integral always converges. 


3. The map is defined over all R with 


f(az)~s>= ; x —> +00. 


Thus S' converges ifn—1 > 1, i.e., the lowest n for which convergence occurs must 


be n = 3. Let us find aes 


£ 
————— dz 
2 (22 +3)3 


then. Define t = x? + 3, so dt = 2xdz, and 
+00 
i : t-3/2 qt = i ; 
2 Jr v7 


4. Interval of convergence of improper integrals: 


a) a € (1,2). 
b) Having factorised x3 +5274 8ar+4 = (x+2)?(x+1), we can study the function 
for x > +00, r > —2 and z > -1: 


(ane, x — too; 
a ae 
1 
I(@) ~ ope’ ES 23 
1 


In order to ensure convergence, we should impose 3a > 1, 2a < 1 plus a < l. 
Therefore a € (4, 4). 


372 
c) a €(-1,1). 
d) The integrand is infinite at x = 2 and x = 3. But 


1 
ea + +00, 
faces =p x oo 
1 
~ = 2 
fle)~ +, 32, 
1 


M2) ~ aye t—3, 


10.5 Exercises 385 


so everything is fine when x + +00 or x — 3. The point + = 2 is problematic 
only if included in the domain of integration, whence we should have a > 2 to 
guarantee convergence. 

5.a>-s and S=V5. 

6. Convergence of improper integrals: 

a) Diverges; b) converges. 


c) Over (2,-+00) the map is not bounded in a neighbourhood of x = 2. Since 


=2 1 
log = ~ log =(@ — 2) 


c+l 3 


= 


is infinite of lower order than any positive power of =, when 2 > Oo it 


follows f is infinite of lesser order than any a > 0). This order, 


1 
(ea ayiare | 
for a suitable choice of a (e.g., @ = $) is smaller than 1 . Therefore the integral 
converges at x = 2. 


For x > +00, 
x—2 3 5 3 
Oe eT os ( x ) , 


x +1 xrt+1 x 
whence 
3 7 3 
72) ae A x — +00 


Altogether, the integral converges. 
d) Let us examine f at x = 0. As 
sina — (x + x2”) log(e+ 2) = x + o(2?) — (24 + 2”) (1 + log (1 + ~)) 
e 
= —2” + o(2”) — (x + 2) (< + o(z)) 


1 
=- (142) 2 +0(0%), x0, 


we have 


pays 


The integral then must diverge at x = 0. 
Studying the behaviour for x — +00 is unnecessary to conclude that the 
integral diverges (albeit a direct computation would establish the same). 


7. When ¢ € [1, 2], 
t?(1 + 8t?) 1 


f (y(t) = J1 +f +44’ ard) _ (1, 2t, 7) ’ 


386 10 Integral calculus IT 


whence 


(1 + 8#?) L ia _ 63 
1+224+4t4dt= fa (14+ 87) dt = —. 
i -[= ai re 2 


8. 0. 


9. First of all we find the coordinates of B, intersection in the first quadrant of the 
straight line y = 2x and the ellipse 4z?+y? = 4, i.e., B= (2, V2). The piecewise- 
regular curve y can be divided into three regular arcs 7, Y2, Y3, Whose images 
are the segment OA, the elliptical path AB and the segment BO respectively. Let 
us reparametrise these arcs by calling them: 


61 (t) = (t,0) (a. © 6.=%1; 
6o(t) = (cost, 2sint) 0<t< a bo ~ Yo, 
63(t) = (t, 2t) ie 63~ —Y3- 
Then 
| rf f+ f ‘a 
Since 
f(6i(d) =¢, f (52(t)) = cost + 2sint, f (53(t)) = 3t, 
6, (4) = (1, 0). 65(t) = (—sint, 2cost), d.()= (1,2); 
Fi =1, |O@|| = Vsin?t+4cos?t, (65) = V5, 
we have 


V2/2 
fi ae tare [ (cost + 2sint) Vein? t+ Loos tat + | 3V5t dt 
0 


1 m/A mw /A 
= G ive+ cost i= 3sint tat +2 / sint./ 1+ 3cos? t dt 
0 
1 


=-4°V5+h+h. 
=+5V5+h+h 


To compute J,, put u= V3sint, so du = V3costdt, and 
V6/2 
ik 
V4—u7du. 
V3 J 


With the substitution v = $, and recalling Example 9.13 vi), 


= 


V6/2 

: fA u Vb 2 V6 
ho= — |-uv4—-u?2?4+2 in — = — + — in —. 
1 BB 5 ue + ssesin S| m Baal 7 


10.5 Exercises 387 


For Iz the story goes analogously: let u = V3cost, hence du = —V3sintdt and 


V6/2 

2 

h=-— | V1l+u?du. 
V3 Jo 


By Example 9.13 v), we have 


V6/2 
2 [1 1 
Ip = —— | aur/1 4 24 =1 1+u2 
; = 5" +u + 5 log ( +u +u)] 
V5 1 V10 — V6 
= —~+ +24 | log(2 a fe a 
oo ae og(2 + V3) — log : 


Overall, 


5 Wh. 2 v6 1 V10 — /6 
[pa Be Bo Soares Ze (og + VI) og 8) 


10. 2arctan /2 + V2, 
11. Since F(+(t)) = (t*,t?) and +/(é) = (2, 1), 


1 1 
7 
[F-ap=| (4.8) @t1ar= f (2¢° +t?) dt = —. 
“y 4 ‘i 12 
9 T 
ie i 
4’ 4 


14. The arc + is piecewise-regular, so we take the regular bits y,, Y2, Y3 whose 
images are the segments AB, BC’, CD. Define 6;, reparametrisation of y,;, Vi = 
1, 2,3, by 


d1(4) =, 1) 0 t <1, O1~ 71; 
J2(t) = (t,2 —t) C2451, d2~—-7o, 
63(t) = (t, 2) O0O<t<l, 03 ~~ 3 
Since 
F(6i(t)) = (t,t7), F(62(t)) = €2-#)?,07(2-2)), F(6s(t)) = (4t, 227) 
6; (t) = (1,0), 6,(t) = (1,-1), 63(t) = (1,0), 
one has 


[reaps fj v-ap— fraps f Far 
= [we -aoa-f (t(2 — 1)?, 2(2—t)) -(1,-1) dt 
0 0 


1 
+f (4t, 2t7) - (1,0) dt = 2. 
0 


15. 0. 


11 


Ordinary differential equations 


A large part of the natural phenomena occurring in physics, engineering and other 
applied sciences can be described by a mathematical model, a collection of relations 
involving a function and its derivatives. The example of uniformly accelerated 
motion is typical, the relation being 


—" =, (11.1) 


where s = s(t) is the motion in function of time t, and g is the acceleration. 
Another example is radioactive decay. The rate of disintegration of a radioactive 
substance in time is proportional to the quantity of matter: 


dy 

dt 
in which y = y(t) is the mass of the element and k > 0 the decay constant. The 
above relations are instances of differential equations. 

The present chapter aims at introducing the reader to some types of differential 
equations. Although we cannot afford to go into the general theory, we will present 
the basic notions and explain a few techniques for solving certain classes of differ- 
ential equations (of first and second order) that we judge particularly significant. 


= —ky, (11.2) 


11.1 General definitions 
By an ordinary differential equation, abbreviated ODE, one understands a 


relation among an independent real variable, say x, an unknown function y = y(z) 
and its derivatives y“) up to a specified order n. It is indicated by 


FG re i=, (11.3) 


where ¥ is a real map depending on n+ 2 real variables. The differential equation 
has order n, if n is the highest order of differentiation in (11.3). A solution (in 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_11, 
© Springer International Publishing Switzerland 2015 


390 11 Ordinary differential equations 


the classical sense) of the ODE over a real interval J is a function y : I > R, 
differentiable n times on J, such that 


F(2,y(x),y'(2),..,.y(x)) =0 for alla eI. 


It happens many times that the highest derivative y(”) in (11.3) can be ex- 
pressed in terms of x and the remaining derivatives explicitly, 


eee ma), (11.4) 


with f a real function of n+ 1 variables (in several concrete cases this is precisely 
the form in which the equation crops up). If so, the differential equation is written 
in normal form. It should also be clear what the term ‘solution of an ordinary 
differential equation in normal form’ means. 

A differential equation is said autonomous if F (or f) does not depend on 
the variable x. Equations (11.1), (11.2) are autonomous differential equations in 
normal form, of order two and one respectively. 

The rest of the chapter is committed to first order differential equations in 
normal form, together with a particularly important class of equations of the 
second order. 


11.2 First order differential equations 


Let f be a real-valued map defined on a subset of R?. A solution to the equation 
y = f(z,y) (11.5) 


over an interval J of R is a differentiable map y = y(x) such that y/(x) = f (x, y(x)) 
for any x € I. The graph of a solution to (11.5) is called integral curve of the 
differential equation. 

Relation (11.5) admits a significant geometric interpretation. For each point 
(x,y) in the domain of f, f(x,y) is the slope of the tangent to the integral curve 
containing (x,y) — assuming the curve exists in the first place — so equation (11.5) 
is fittingly represented by a field of directions in the plane (see Fig. 11.1). 


Remark 11.1 If we start to move from (x,y) = (#0, yo) along the straight line 
with slope f(xo, yo) (the tangent), we reach a point (x1, y1) in the proximity of 
the integral curve passing through (xo, yo). From there we can advance a little bit 
farther along the next tangent, reach (x2, y2) nearby the curve and so on, progress- 
ively building a polygonal path close to the integral curve issuing from (29, yo). 
This is the so-called explicit Euler method which is the simplest numerical proced- 
ure for approximating the solution of a differential equation when no analytical 
tools are available. This and other techniques are the content of the lecture course 
on Numerical Analysis. 


11.2 First order differential equations 391 
VHPP HOSA CHOP IAP PT IAT TG 
VISAS ASASAALAS SOSA AAAAGA” 
VIS SLSALSALALALALASSLALLAAAAAA 
Vth hbhsseeccrrrrr/ LLLALITAAA 
hhh eeecccee(err sl SILLA 
Vb h bse eeccccceecrrs f/f ftidti 
MM ee aan ce ee ee 
bbb lhl een asa 
ne ee i 
ee aaainaineinn teitntnneeede ae eA 
hhh 4 4 eoorrr rns es sr SS 
| at A A A A ett tt eee ae a 
PPI S Fea NN NNN eer 
PPI S Fem NIN NNN NNN 
PPI SS FARA NNNNNNNNANNSN SO 
PP SSS ARR NNNINN NN NNNNNNS™ 


Figure 11.1. Field of directions representing y’ = (1+ 2)y+ 2” 


Solving (11.5) generalises the problem of finding the primitives of a given map. 
If f depends on x and not on y, (11.5) reads 


y' = f(z); (11.6) 


assuming f continuous on J, the solutions are precisely the primitives y(x) = 
F(a) +C of f over J, with F a particular primitive and C an arbitrary constant. 
This shows that, at least in the case where f does not depend upon y, (11.5) admits 
infinitely many distinct solutions, which depend on one constant. Note that any 
chosen integral curve is the vertical translate of another. 

Actually, equation (11.6) plays a fundamental role, because in several circum- 
stances, suitable manipulations show that solving (11.5) boils down to the quest for 
primitives of known functions. Furthermore, under fairly general hypotheses one 
can prove that (11.5) always admits a one-parameter family of distinct solutions, 
depending on an arbitrary constant of integration C’. We shall write solutions in 
the form 

y = y(z;C) (11.7) 


with C varying in (an interval of) R. An expression like (11.7) is the general 
integral of equation (11.5), while any solution corresponding to a particular choice 
of C’ shall be a particular integral. 


Example 11.2 
Solving the differential equation 
y=y (11.8) 
amounts to locating the maps that coincide with their first derivative. We have 
already remarked that the exponential y(z) = e” enjoys this important property. 


392 11 Ordinary differential equations 


< 


‘a 


Figure 11.2. Integral curves of y’ = y 


Since differentiating is a linear operation, any function y(x) = Ce”, C € R 
possesses this feature. Further on we will prove there are no other maps doing 
the same, so we can conclude that the solutions to (11.8) belong to the family 


y(x; C') = Ce”, CeR. 


The integral curves are drawn in Fig. 11.2. 


In order to get hold of a particular integral of (11.5), one should tell how to 
select one value of the constant of integration. A customary way to do so is to 
ask that the solution assume a specific value at a point x fixed in advance. More 
explicitly, we impose y(xo0;C’) = yo, where xo and yo are given, corresponding to 
the geometric constraint that the integral curve passes through (Zo, yo). Essen- 
tially, we have solved a so-called initial value problem. More precisely, an initial 
value problem, or a Cauchy problem, for (11.5) on the interval J consists in 
determining a differentiable function y = y(x) such that 


(11.9) 


ee on J, 


y(xo) = Yo 


with given points zo € I, yo € R. The understated reference to time in the words 
‘initial value’ is due to the fact that many instances of (11.9) model the evolution 
of a physical system, which is in the state yo at the time xo in which simulation 
starts. 


11.2 First order differential equations 393 


Example 11.3 
The initial value problem 
(ae on I = [0,+00), 
y(0) = 2, 
is solved by the function y(x) = 2e”. Oo 


Remark 11.4 The prescription of an initial condition, albeit rather common, is 
not the sole possibility to pin down a particular solution of a differential equation. 
Take the following problem as example: find the solution of y’ = y having mean 
value 1 on the interval I = [0,2]. The general solution y = Ce” has to satisfy the 
constraint 


which easily yields C = +. oO 


e2—1 


Remark 11.5 Let us return to equations of order n for the moment. With the 
proper assumptions, the general integral of such an equation depends upon n real, 
arbitrary constants of integration C; (k = 1, 2,...,n) 


iy = yas C1, Ga-5,C,): 


The initial value problem supplies the values of y and its n — 1 derivatives at a 
given %p € I 

pO ate vi Vy ond, 

y(Zo) = Yoo, 

y'(xo) = yor, 


y—) (29) = yon-1) 
where Yoo, Yo1; ++ Yo.n—1 are n fixed real numbers. For instance, the trajectory of 
the particle described by equation (11.1) is uniquely determined by the initial 
position s(0) and initial velocity s’(0). 

Besides initial value problems, a particular solution to a higher order equation 
can be found by assigning values to the solution (and/or some derivatives) at the 
end-points of the interval. In this case one speaks of a boundary value problem. 
For instance, the problem of the second order 


{ y’ =ksiny on the interval (a, b), 
y(a) = 0, y(b) =0, 


models the sag from the rest position of a thin elastic beam subject to a small 
load acting in the direction of the z-axis. 


We focus now on three special kinds of first order differential equations, which 
can be solved by finding few primitive functions. 


394 11 Ordinary differential equations 
11.2.1 Equations with separable variables 


The variables are said “separable” in differential equations of type 


y’ = g(x)h(y), (11.10) 


where the f(x,y) of (11.5) is the product of a continuous g depending only on z, 
and a continuous function h of y alone. 

If y € R annihilates h, ie., h(y) = 0, the constant map y(x) = y is a particular 
integral of (11.10), for the equation becomes 0 = 0. Therefore an equation with 
separable variables has, to start with, as many particular solutions y(x) = constant 
as the number of distinct zeroes of h. These are called singular integrals of the 
differential equation. 

On each interval J where h(y) does not vanish we can write (11.10) as 


1 dy _ ie 
h(y) da ane 

1 
Let H(y) be a primitive of an (with respect to y). By the Chain rule (Theorem 

7] 
6.7) 

d _eaiidy.. 1 dy _ 

dz (y(a)) = dy dx h(y)dx a2), 


so H(y(zx)) is a primitive function of g(x). Therefore, given an arbitrary primitive 
G(x) of g(x), we have 


H(y(x)) = G(x) +C, CeER. (iia) 
1 dH : 
But we assumed ACH = ra had no zeroes on J, hence it must have constant 


sign (being continuous). This implies that H(y) is strictly monotone on J, i.e., 
invertible by Theorem 2.8. We are then allowed to make y(x) explicit in (11.11): 


y(x) = H~*(G(z) +C), (11.12) 


where H~! is the inverse of H. This expression is the general integral of equation 
(11.10) over every interval where h(y(a)) is never zero. But, should we not be 
able to attain the analytic expression of H~'(x), formula (11.12) would have a 
paltry theoretical meaning. In such an event one is entitled to stop at the implicit 
torn’. (11,11), 

If equation (11.10) has singular solutions, these might admit the form (11.12) 
for special values of C’. Sometimes, taking the limit for C — too in (11.12) fur- 
nishes singular integrals. 


11.2 First order differential equations 395 


d 
Formula (11.11) is best remembered by interpreting the derivative 8 ag 


ry 
formal ratio, following Leibniz. Namely, dividing (11.10) by h(y) and ‘multiplying’ 
by dz gives 


which can be then integrated 


This corresponds exactly to (11.11). The reader must not forget though that the 
correct proof of the formula is the one showed earlier! 


Examples 11.6 


i) Solve the differential equation y’ = y(1—y). Let us put g(x) = 1 and 
h(y) = y(1 — y). The zeroes of h produce two singular integrals yi(”) = 0 and 
yo(a) = 1. 

Suppose now h(y) is not 0. We write the equation as 


then integrate with respect to y on the left, and on the right with respect to x 


log ree li +C 
Exponentiating, we obtain 
4 =e = he 
LG 
where k = e© is an arbitrary positive constant. Therefore 
= =+ke* = Ke", 
ee! 
K being any non-zero constant. Writing y in function of x, we get 
(x) Ke* 
2) = ——., 
4 1+ Ke* 


Note the singular solution y:(x) = 0 belongs to the above family for kK = 0, a 
value K was originally prevented to take. The other singular integral, yo(x) = 1, 
formally arises by letting K go to infinity. 


ii) Consider the equation 


y = VV. 
At first glance we spot the singular solution y;(z) = 0. That apart, by separating 
variables we have 


dy / 
—-= /dz hence 2/§ =LEC, 
i Vy 


396 11 Ordinary differential equations 
and so 
xr 2 
y(e) = ($+C) : CeR 

(where C'/2 has become C). 
iii) Solve 

, od 

ev +1 


> 0 for any y; there are no singular integrals. 


Let g(x) = e*+1 and A(y) = al 


The separation of x and y yields 


[tert nay = fle + iar, 


e+y=e74+204+C, CER. 
But now we are stuck, for it is not possible to explicitly write y as function of 
the variable x. 


11.2.2 Linear equations 


A differential equation akin to 
y +a(x)y = 0(2), (11.13) 


where a and b are continuous on J, is called linear, because the function f(x,y) = 
—a(x)y + b(a) is a linear polynomial in y with coefficients in the variable z. 
This equation is said homogeneous if the source term vanishes, b(7) = 0, non- 
homogeneous otherwise. 

We begin by solving the homogeneous case 


y = —a(x)y. (11.14) 
This is a particular example of equation with separable variables. So referring to 


(11.10) we have g(x) = —a(x) and h(y) = y. The constant y(x) = 0 is a solution. 
Excluding this possibility, we can write 


| 
[ow - [ ale) ax. 
Y 
If A(x) denotes a primitive of a(x), ie., if 
Jw) dx = A(x) + C, CER, (11.15) 


then 


11.2 First order differential equations 397 
or, equivalently, 


ly(x)| =e e ; hence y (x) — +Ke~ A), 


where K = e~© > 0. The particular integral y(x) = 0 is included if we allow K to 
become 0. The solutions of the homogeneous linear equation (11.14) are 


y(x) = Ke4@), =K ER, 


with A(x) defined by (11.15). 
Now let us assess the case b 4 0. We make use of the method of variation of 
parameters, which consists in searching for solutions of the form 


y(a) = K(aje 4™, 


where K(x), a function of x, is unknown. Such a representation for y(z) always 
exists, since e~4() > 0. Substituting in (11.13), we obtain 


K'(a)e 4AM + K(x)e74@ ( — a(x)) + a(z)K(z)e"4™ = W(x), 


or 
K' (a) = eA (2). 


Calling B(«) a primitive of e4d(2), 
[foe dz=B(z)+C, CER, (11.16) 


we have 
K(x) = B(x) +C, 


so the general solution to (11.13) reads 
y(z) =e" 4) (B(x) +C), (11.17) 


where A(x) and B(x) are defined by (11.15) and (11.16). The integral is more 
often than not found in the form 


ula) =e Fete" Fol #8 52) de (11.18) 


The expression highlights the various steps involved in the solution of a non- 
homogeneous linear equation: one has to integrate twice, in succession. 
If we are asked to solve the initial value problem 


; ‘ 
y +a(x)y = b(x) on the interval J, 
" (*) (11.19) 


y(Xo) = yo, with xo € I and yo € R, 


398 11 Ordinary differential equations 


we might want to choose, as primitive for a(x), the one vanishing at 2 9, which we 


write A(x) = i a(s)ds by the Fundamental Theorem of integral calculus; the 


10) 
same we do for 


Bu) = / el ne 1) 95 04) at 
xO 


(recall the variables in the definite integral are arbitrary symbols). Substituting 
these expressions in (11.17) we obtain y(ao) = C, hence the solution to (11.19) 
will satisfy C = yo, namely 


y(z) =e” Feo 19) 48 (vo +f eben 28) 49 54) ar) (11.20) 
xo 


Examples 11.7 


i) Determine the general integral of the linear equation 


y’ tay = b, 
b 
where a # 0 and 6 are real numbers. By choosing A(z) = ax, B(x) = —e*” we 
a 
find the general solution 
b 
y(z) = Ce + Fe 
If a = —1, b= 0, the formula provides the announced result that every solution 


of y’ = y has the form y(x) = Ce®. 
For the initial value problem 
1° +ay=b on [1,+00), 
y(1) = yo, 

b 
it is convenient to have A(z) = a(x — 1), B(x) = - (ere) = 1), so that 

a 

b b 

y(x) = (1 - -) e te) 4 =, 


a 
Note that if a > 0 the solution converges to — for x + +oo (independent of the 
initial datum yo). ° 
ii) Determine the integral curves of 
ty +y=2" 
that lie in the first quadrant of the (x, y)-plane. Written as (11.13), the equation is 
ep, tt 
yr oe 
so a(t) = +, b(x) = x. With A(x) = logz we have e4(@) = gz and e~4@ = +, 


rs) 


Consequently, 
[ft v0o) dr= ie dz = =2° +C. 


11.2 First order differential equations 399 


Therefore, when x > 0 the general integral is 


ioe eae &: 
= _ C = —, 
y (x) = (G04 ) 32 ae 
For C > 0, y(x) > 0 for any x > 0, whereas C' < 0 implies y(x) > 0 for 


x > ¥%/3|C. Oo 


11.2.3 Homogeneous equations 


Homogeneity refers to the form 


y=0(2) (11.21) 


in which y = y(z) is continuous in the variable z. Thus, f(x,y) depends on x, y 

only in terms of their ratio a we can equivalently say that f(Ax, Ay) = f(x, y) for 
£ 

any A> 0. 


A homogeneous equation can be solved by separation of variables, in that one 


ya) 


puts z= Z to be understood as z(x) = . In this manner y(x) = xz(x) and 
a 


y' (x) = 2(x) + xz’(x). Substituting in (11.21) yields 


—_ y(z) mn 
£ = — , 
x 
an equation in z where the variables are separated. We can apply the strategy of 
Sect. 11.2.1. Every solution Z of y(z) = z gives rise to a singular integral z(x) = Z, 


i.e., y(x) = Zx. Supposing instead y(z) different from z, we have 


giving 


H(z) = log |a| + C. 


where H(z) is a primitive of . Indicating by H~! the inverse map, we 


y(z)— 2 
have 


z(x) = H7*(log |x| + C), 
so the general integral of (11.21) reads (returning to y) 


y(z) = 2H "(log |z| + C). 


400 11 Ordinary differential equations 


Example 11.8 
Solve 
Qf _ ,2 2 
Cy HY + ay a (11.22) 
We can put the equation in normal form 
,_ (¥\? ¥ 
= (HE 
oe i 
which is homogeneous for y(z) = 27 + z+ 1. Substituting y = xz, we arrive at 
2 
, we +1 


a= 5 
w 


whose variables are separated. 
As z? +1 is positive, there are no singular solutions. Integrating we obtain 
arctan z = log |z|+C 
and the general solution to (11.22) is 
y(x) = xtan(log |x| + C). 
We remark that C can be chosen either in (—oo, 0) or in (0, +00), because of the 


singularity at x = 0. Moreover, the domain of existence of each solution depends 
on the value of C. Oo 


11.2.4 Second order equations reducible to first order 


Suppose an equation of second order does not contain the variable y explicitly, 
that is, 


y" = f(y’,2). (11.23) 
Then the substitution z = y’ transforms it into a first order equation 
z' = f(z,2) 


in the unknown z = z(z). If the latter has general solution z(x; C1), we can recover 
the integrals of (11.23) by solving 


hence by finding the primitives of z(x;C}). This will generate a new constant of 
integration C2. The general solution to (11.23) will have the form 


y(x; Cy, C2) = (ec C\) dx = Z(a;C 1) + Ca, 


where Z(x;C;) is a particular primitive of z(x; C1). 


11.3 Initial value problems for equations of the first order 401 


Example 11.9 


Solve 
y" — (P= 1. 
Put z = y’ so that the equation becomes 
/ 2 
z=2°+1, 


The variables are separated and the integral is arctan z = x + C4, ie., 
z(x,C ) = tan(a + C}). 


Integrating once again, 


y(x; Cy, C2) = [tan(o+ Cy) az 
/[* sin(x + C;) 


os(a + C;) 
= — log ( cos( t+C1))+C2, C,,C2 ER. 


11.3 Initial value problems for equations of the first order 


Hitherto we have surveyed families of differential equations of the first order, and 
shown ways to express the general solution in terms of indefinite integrals of known 
functions. These examples do not exhaust the class of equations which can be 
solved analytically, and various other devices have been developed to furnish ex- 
act solutions to equations with particularly interesting applications. That said, 
analytical tools are not available for any conceivable equation, and even when so, 
they might be unpractical. In these cases it is necessary to adopt approximations, 
often numerical ones. Most of the times one can really only hope to approximate 
an integral stemming, for instance, from an initial value problem. The use of such 
techniques must in any case follow a qualitative investigation of the ODE, to make 
sure at least that a solution exists. A qualitative study of this kind has its own 
interest, regardless of subsequent approximations, for it allows to understand in 
which way the solution of an initial value problem depends upon the initial datum, 
among other things. 

Let us analyse the problem (11.9) and talk about a simple constraint on f that 
has a series of consequences: in the first place it guarantees that the problem admits 
a solution in a neighbourhood of 29; secondly, that such solution is unique, and 
thirdly, that the latter depends on yo with continuity. Should all this happen, we 
say that the initial value problem (11.9) is well posed (in the sense of Hadamard). 


11.3.1 Lipschitz functions 


Before getting going, we present a remarkable way in which functions can depend 
on their variables. 


402 11 Ordinary differential equations 


Definition 11.10 A real-valued map of one real variable f : J > R, J 
interval, is said Lipschitz continuous on J if there exists a constant L > 0 
such that 


If(y1) —fy2)|<Llyi-— yl,  Vyrryoe J. (2) 


Another way to write the same is 


lf(y1) 7 f(y2)| 27. 


Vy1,y2 EF, m1 FY2, (11.25) 
ly. — yal 


which means the difference quotient of f is bounded as y; ~ yg vary in J. 

If (11.24) holds for a certain constant L, it is valid for bigger numbers too. 
The smallest constant fulfilling (11.24) is called Lipschitz constant of f on J. 
The Lipschitz constant is nothing else but the supremum of the left-hand side 
of (11.25), when the variables vary in J. This number is far from being easy to 
determine, but normally one makes do with an approximation from above. 

A Lipschitz-continuous map on J is necessarily continuous everywhere on J 
(actually, it is uniformly continuous on J, according to the definition given in 
Appendix A.3.3, p. 447), for condition (3.6) works with 6 = e/L. Continuous 
maps that fail (11.25) do exist nevertheless, like f(y) = ,/y over J = [0,+00); 
choosing y2 = 0 we have 


f(y) — fa) _ vu _ dt 


= —— = —., Vyi > 0, 
ly — ye yo (Mi 
and in the limit for y; — 0 the ratio on the left exceeds any constant. Note that 
the function has infinite (backward) derivative at y = 0. 
The forthcoming result is the quickest to adopt, among those testing Lipschitz 
continuity. 


Proposition 11.11 Let f : J > R be differentiable on J with bounded de- 
rivative, and L = sup|f’(y)| < +oo. Then f is Lipschitz continuous on J 


yed 
with Lipschitz constant L. 


Proof. For (11.24) it is enough to employ the second formula of the finite incre- 
ment (6.13) to f between yj, y2, so that 


flu) — f(y2) = fa) — ye) 


for some y between y; and yo. Therefore 


f(y.) — F(y2)| = FD ly — yal < Lily — y2l- 


This proves the Lipschitz constant L* of f is < L. 


11.3 Initial value problems for equations of the first order 403 


Vice versa, take any yo € J. By (11.25) 


¥Y— Uo 
SC) 
'f"(yo)| =| tim LDA LH) — jim |S = LO) ce 
ve yyo| Y—Yo 
and then L < L*. 7 


Let us see some examples of Lipschitz-continuous maps. 


Examples 11.12 


i) The function f(y) = \/y is Lipschitz continuous on every interval [a, +oo) with 
a > 0, because 


He 


0< SW) =e 


. . a . . Bl 
on said intervals; the Lipschitz constant is L = Tea" 
fy 


ii) The trigonometric maps f(y) = siny, ) = cosy are Lipschitz continuous 
on the whole R with L = 1, since |f’(y)| < 1, Vy € R and there exist y € R at 
which | f’(y)| = 1. 


ii) The exponential f(y) = e” is Lipschitz continuous on all intervals (—oo, 6], 


b € R, with constant L = e?: it is not globally Lipschitz continuous, for 
sup f(y) = +00. 
yER 


Proposition 11.11 gives a sufficient condition for Lipschitz continuity. A func- 
tion can in fact be Lipschitz continuous on an interval without being differentiable: 
f(y) = ly| is not differentiable at the origin, yet has Lipschitz constant 1 every- 
where on R, because 


Iya — lel] <lar-—yel,  Vyr.yo2 ER. 


Now to several variables. A function f : Q C R¢ + Ris Lipschitz continuous 
on 2 if there is a constant L > 0 such that 


f(y.) — F(ya)| < Lllys — yell, VY, Y2 E22. 


We say a map f : J x J C R? > R, with J, J real intervals, is Lipschitz 
continuous on {2 = [ x J in y, uniformly in 2, if there is a constant L > 0 
such that 


lf(z,y1) — F(x, y2)| < Lily — yo, Vyi1,y2 € J, Vee l. (11.26) 


This condition holds if f has bounded partial y-derivative on (2, ie., LD = 


sup 
(x,y)EN 


0 
a (2, i) < +00, because Proposition 11.11 can be applied for every x € I. 
7] 


404 11 Ordinary differential equations 


Example 11.13 
Consider 
f(x,y) = Vzsin(z + y) 
on §2 = [—8,8] x R. Since 
Of ‘ 
5 (x,y) = Vrcos(z + y), 
Oy 
for any (x,y) € 2 


FX (e.y)| =|9E I eos(e +y)| < VB-1=2. 


Thus (11.26) holds with L = 2. 


11.3.2 A criterion for solving initial value problems 


After the intermezzo on Lipschitz-continuous functions, we are ready to state the 
main result concerning the initial value problem (11.9). 


Theorem 11.14 Let I, J be non-empty real intervals, J additionally open. 
Suppose f : Q=IxJ CR? > R is continuous on Q and Lipschitz continuous 
on 2 in y, uniformly in x. 

For any (20, Yo) € §2, the initial value problem (11.9) admits one, and only 
one, solution y = y(x), defined and differentiable with continuity on an in- 
terval I' CI containing xo and bigger than a singlet, such that a, y(x)) EN 
for anyx el’. 

If ¥ = (x) denotes the solution on an interval I’ C I to the problem with 
initial value (xo, Yo) € 2, then 


ly(x) — g(x)| < eF!*?-*llyq — Gol, Veer’ ni", (37) 


where L is the constant of (11.26). 


The theorem ensures existence and uniqueness of a “local” solution, a solution 
defined in a neighbourhood of xo. The point is, the solution might be defined not 
everywhere on J, because the integral curve (nis y(x)), also known as trajectory, 
could leave the region 2 before x has run over the entire J. For example, f(y) = y? 
is Lipschitz continuous on every bounded interval J, = (—a,a), a > 0, because 


sup |f’(y)| = sup |2y| = 2a, 
yeda ly|<a 


but is not Lipschitz continuous on R. The initial value problem 
/ 2 


y=y 
(11.28) 


11.3 Initial value problems for equations of the first order 405 


A 


1/2 


| 
| 
| 
| 
! 
Figure 11.3. The solution of (11.28) is not defined on I = [0, +00) 
has no solution over all of J = [0, +00): separating variables we discover 


_ 1 
2g’ 


y(x) 


showing that the trajectory (a y(x)) leaves every strip Q, = I x Ja, a > 1, before 
x can reach 2 (see Fig. 11.3). 

When the theorem is true with J = R, we can prove the solution exists over 
all of JI. 

The uniqueness of the solution to (11.9) follows immediately from (11.27): if 
y(x) and (a) are solutions corresponding to the same initial datum yo = Yo at 
xo, then y(x) = y(x) for any x. 


Observe that if f is not Lipschitz continuous in the second variable around 
(xo, yo), the initial value problem may have many solutions. The problem 


{ y= V9, 

y(0) = 0 

is solvable by separation of variables, and admits the constant y(z) = 0 (the 
singular integral), as well as y(x) = 42? as solutions. As a matter of fact there are 
infinitely many solutions 


0 if0<a<e, 
ue)={ c20, 


z(a@—c)? ife>e, 


obtained by ‘gluing’ in the right way the aforementioned integrals. 


Finally, (11.27) expresses the continuous dependency of the solution to (11.9) 
upon yo: an é-deviation of the initial datum affects at most by e/!*—*ele the solu- 
tion at « # x. Otherwise said, when two solutions evolve the distance of the 


406 11 Ordinary differential equations 


corresponding trajectories can grow at most by the factor e/!*—*°l in going from 
zo to x. In any case the factor e/!*~*le is an exponential in x, so its impact 
depends on the distance |x — z| and on the Lipschitz constant. 


11.4 Linear second order equations with constant 
coefficients 


A linear equation of order two with constant coefficients has the form 


y +ay’ + by = 49, (11.29) 


where a, b are real constants and g = g(x) is a continuous map. We shall prove 
that the general integral can be computed without too big an effort in case g = 0, 
hence when the equation is homogeneous. We will show, moreover, how to find 
the explicit solutions when g is a product of exponentials, algebraic polynomials, 
sine- and cosine-type functions or, in general, a sum of these. 

To study equation (11.29) we let the map y = y(a) be complex-valued, for 
convenience. The function y : J C R — C is (n times) differentiable if y, = 
Rey: I > Rand y; = Imy: I > R are (n times) differentiable, in which case 
ya) = ys” (x) + iy{” (@). 

A special case of this situation goes as follows. Let \ = A, + 7A; € C be an 
arbitrary complex number. With (8.39) in mind, we consider the complex-valued 
map of one real variable x ++ e** = e**(cos A;x + isin \yx). Then 


d 
ve se =e", (11.30) 


precisely as if A were real. In fact, 


d d d 
os et = ae cos Aya) + ise 
= ),e*"* cos Ayr — Aye*”” sin Ay7 + ier? sin \;x2 + A;e*"” cos Vix) 
= r,e*"” (cos Aix + isin yx) + idje*r* (cos Aix + tr; Sin A;z) 


= (A, + ide” = re**. 


r® sin Aix) 


Let us indicate by Ly = y”+ay’+by the left-hand side of (11.29). Differentiating 
is a linear operation, so 


L(ay + Bz) =aLly + BLz (11.31) 


for any a, € R and any twice-differentiable real functions y = y(x), z = z(z). 
Furthermore, the result holds also for a, € C and y = y(x), z = z(x) complex- 
valued. This sort of linearity of the differential equation will be crucial in the 
study. 


11.4 Linear second order equations with constant coefficients 407 
We are ready to tackle (11.29). Let us begin with the homogeneous case 
Ly =y" +ay’ + by =0, (11.32) 


and denote by 
KA) =r? +ard+b 


the characteristic polynomial of the differential equation, obtained by replacing 
kth derivatives by the power \*, for every k > 0. Equation (11.30) suggests to look 
for a solution of the form y(x) = e** for a suitable 4. If we do so, 


Le) — \2er# div ader* as be* = xe, 


and the equation holds if and only if A is a root of the characteristic equation 


MW +ar+b=0. 
When the discriminant A = a? — 4b is non-zero, there are two distinct roots 1, A2, 
to whom correspond distinct solutions y;(2) = e*!” and yo(x) = e*2*; roots and 


relative solutions are real if A > 0, complex-conjugate if A < 0. When A = 0, 
there is a double root \, hence one solution y;(x) = e*”. Multiplicity two implies 
x/(A) = 0; letting yo(x) = xe*”, we have 


yy(x) = (1+ Ax) e*” and yg (x) = (2+ 22) e**. 
Substituting back into the equation we obtain 
L(y2) = x(A) we" + x'(A)e™* =0 


after a few algebraic steps. Therefore the function y2 solves the equation, and is 
other than y;. In all cases, we have found two distinct solutions y1, y2 of (11.32). 
Since (11.31) is linear, if yi, y2 solve (11.32) and C), C2 are constants, then 


L(Ciyi + Cay2) = Ci L(y1) + CoL(y2) = C10 + C20 = 0, 


hence the linear combination C,y; + C2y2 is yet another solution of the homogen- 
eous equation. Moreover, if y denotes a solution, one can prove that there exist 
two constants C;, Co such that y = Cy; + Coye, where y1, yo are the solutions 
found earlier. 

In conclusion, the general integral of the homogeneous equation (11.32) takes 
the form 


y(a; C1, C2) = Ciyi(x) + Co yo(z), 


with C,, C2 constants and yi(x), y2(x) defined by the recipe: 


if A#0, y:(x) = e** and yo(x) = e*2* with 1, Az distinct roots of the charac- 
teristic equation (A) = 0; 


if A=0, yi(x) = e** and y2(x) = xe*”, where X is the double root of x(A) = 0. 


408 11 Ordinary differential equations 


When A < 0, the solution can be written using real functions, instead of 
complex-conjugate ones as above. It is enough to substitute to yi(x), y2(a) the 
real part e*"* cos\;2 and the imaginary part e*"”sin\;2 of y;(x) respectively, 
where \; = Ap = Ay + iA;. In fact, if y is a solution of the homogeneous equation, 


L(Rey) = Re (Ly) = Re0 = 0, L(imy) =Im (Ly) =In0 = 0 


since the coefficients are real, so Rey and ZJmy are solutions too. 
Summarising, the general integral of the homogeneous equation (11.32) can be 
expressed in terms of real functions as follows. 


The case A > 0. The characteristic equation has two distinct real roots 


-atVA 
2 


A1,2 = 
and the general integral reads 
y(a; C1, C2) = Cye™* + Coe", 


with C,, C2 arbitrary constants. 


The case A = 0. The characteristic equation has a double root 


and the general integral reads 


y (a; Ci, C2) = (Ci + C22) Bee ; C1, Co ER. 


The case A <0. The characteristic equation has no real roots. Defining 


a VII 


— r——t) == -———_., 
c=A 5 W 5 


the general integral reads 


y(x; C1, Co) = e°* (Ci coswx + Co sinwz) , Ci, C2 ER. 


Now we are ready for the non-homogeneous equation (11.29). The general 
integral can be written like 


WerCie Co )i— Vol GC 1, ae Up le (11.33) 


11.4 Linear second order equations with constant coefficients 409 


where yo(x; C1, C2) is the general solution of the associated homogeneous equation 
(11.32), while y,(x) denotes an arbitrary particular integral of (11.29). Based on 
linearity in fact, 


L(yo + Yo) = L(yo) + L(yp) =O+9 =9, 


so the right-hand side of (11.33) solves (11.29). Vice versa, if y(a) is a generic 
solution of (11.29), the function y(x) — y»(x) satisfies 


L(y — yp) = L(y) — Lp) =9-9 = 9, 
so it will be of the form yo(x; C1, C2) for some C; and C3. 


Should the source term g be a mixture of products of algebraic polynomials, 
trigonometric and exponentials functions, we can find a particular integral of the 
same sort. To understand better, we start with g(a) = p(x) e°”, where a € C and 
Pn(x) is a polynomial of degree n > 0. We look for a particular solution of the form 
Yp(x) = n(x) e®, with gy unknown polynomial of degree N > n. Substituting 
the latter and its derivatives in the equation, we obtain 


L(qn(x) e**) = (x(a)gn (2) + x'(a)aiv (2) + Yn(2)) e°* = Pn(x)e**, 


whence 
x(a)an (x) + x"(a)an(@) + Gn (2) = Pn(2). 

If a is not a characteristic root, it suffices to choose N = n and determine 
the unknown coefficients of g,, by comparing the polynomials on either side of the 
equation; it is better to begin from the leading term and proceed to the lower- 
degree monomials. 

If a is a simple root, x(a) = 0 and x’(a) 4 0; we choose N = n+1 and hunt 
for a polynomial solution of x/(a)qy (x) + dh (a) = pn(x). Since the coefficient of 
dn+1 Of degree 0 is not involved in the expression, we limit ourselves to qn+1 of 
the form gn4i(@) = @Gn(x), with qn, an arbitrary polynomial of degree n. 

Eventually, if a is a multiple root, we put N = n+2 and solve q)),9(x) = pn(2), 
seeking dn42 in the form gn+2(x) = 27qn(x), where gn is arbitrary and of degree 
n. In the second and third cases one speaks of resonance. 

When a is complex, x(a) and x(a) are complex expressions, so gn (x) has to be 
found among polynomials over C, generally speaking. But as in the homogeneous 
case, we can eschew complex variables by inspecting the real and imaginary parts 
of pp (x) e°*; with a = w+ i, they are p,(x) e"* cosvxz and p,(x) e#” sin vz. 

Our analysis has shown that if the source term g is real and of the form 


(2) =p, (F)e"* cos vx gc) =p, (7) 6 sin ya. (11.34) 


we can attempt to find a particular solution 


Una) =e! “(Gi Lycos Ux 4d> 5, (2) siya), (11.35) 


410 11 Ordinary differential equations 


where gi,n(xz) are algebraic polynomials of degree n, and m is generically 0 except 
in case of resonance: 


i) for A> 0: set m = 1 if 0 = 0 and if coincides with either root A1, Ag of the 
characteristic polynomial; 

ii) for A = 0: set m = 2 if V = 0 and yp coincides with the (double) root 2 of the 
characteristic polynomial; 


lia) for A <0: set mm 1 if peo and OS 


Substituting the particular integral (11.35) in (11.29), and comparing the terms 
cet” sin Ya and xe” cos Vx for all k = 0,...,n, we can determine Uns 

At last, if g is a sum of pieces of the form (11.34), yp will be the sum of 
the particular solutions corresponding to the single source terms: suppose that 
Gg=9n+tgat...+ gx and yx solves L(y) = gx for all k = 1,...,K. Then 


Yp = Yp1 +--- + YpK Satisfies 
L(Y) = Lp) os Lek) = Gi ee ok = 9g: 


and as such it solves L(y) = g as well. This is the so-called principle of superposi- 
tion. 
With the help of a few examples the procedure will result much clearer. 


Examples 11.15 


i) Consider 


y' +y' — by = 9. (11.36) 
First of all, we find the general integral of the associated homogeneous equation 
y +y' —6y =0. (11.37) 
The characteristic equation 
M+A-6=0 


has distinct roots A; = —3, Ag = 2, so the general integral of (11.37) is 
yo(x;C1,C2) = Cre ** + Cy e”*. 

Now we determine a particular solution to (11.36), assuming that g(a) = 3a? — 
a +2. By (11.34), po(x) = 32? — 2 + 2 and y= 0 =0. Since p is neither 1 nor 
2, Yp Will have the form y,(x) = ax? + Bx + y. Substituting Yn Yp in (11.36) 
yields 

—6ax” + (2a — 68)x + (2a + B — 6y) = 327 — 2 +2. 
The comparison of coefficients implies 


1 
uel) = —5(2? +1), 
Therefore, the general integral of (11.36) reads 
1 
yl eCisGe) = C1 Ce = 5 (e" +1). 


11.4 Linear second order equations with constant coefficients All 


Assume instead that g(x) = e?”. In (11. 4) we have po(z) = 1, p = A2 = 2, 
0 =0. We need a yp written as yp(x) = axe?*. Substituting in (11.36) gives 


hence a = z. The general solution is then 


y(a; C1, C2) = Cre ** + (c: + =) ae. 


ii) Examine the equation 

1 = Da! Ly = 9. (11.38) 
The characteristic polynomial \? — 2\+ 1 has a root \ = 1 of multiplicity two. 
The general integral of the homogeneous equation is thus 


yo(x;C1,C2) = (C1 + Cox) e” 
Suppose g(x) = xe®”. As yp = 3 is not \ = 1, we search for a particular solution 
Yp(x) = (ax + 8) e3”. As before, the substitution of the latter back into the 
equation yields 
A(az + at B)e** = re, 
giving 
1 
uel) = F(@—1)e? 
We conclude that the general integral is 
1 
y(x; C1, C2) = (C1 + Cox) ee” + q{% —1)eé 

Taking g(x) = —4e”, instead, calls for a yp of type yp(x) = ax?e*. Then 

2ae” = —4e* 
implies a = —2, and the general solution reads 

y(x; C1, Cz) = (Cy + Cox — 22) e” 


iii) The last example is 


y” +2y' + 5y =g. (11.39) 
This ODE has characteristic equation A? + 2\ +5 = 0 with negative discrim- 
inant A = —16. From o = —1, w = 2, the general integral of the homogeneous 


equation is 
yo(z;C 1, C2) = e * (Ci cos 2a + Cy sin 2x). 
Take g(x) = sina. Referring to the left term in (11.34), we have po(x) = 1, 
yu = 0, 0 = 1. We want a particular integral y,(x) = acosx + Psinzx. Rewrite 
(11.39) using the derivatives y,,, y and yp, so that 
(4a + 26) cosxz + (48 — 2a) sinz = sina. 
1 


Compare the coefficients of sin xz and cosx, so a = — a and 6 = =, 1 


Al 2s 
At) = — Zp cose + zsing. 


412 11 Ordinary differential equations 
The general solution is 


1 i 
y(x) =e *(C1 cos 2a + Co sin 2x) — 19 ©” + 5 sin x. 


Another variant is to suppose g(x) = e *sin2z. Using the first of (11.34), 
wb = o = —-1 and Vv = w = 2, so we look for the particular integral 
Yp(x) = xe~*(acos 2x + Bsin 2x). The substitution yields 

e ”(48 cos 2% — 4asin2x) =e “sin 2z, 


hence a = —i, @ = 0, and the general solution reads 


1 
y(z) =e” ((c: = =) cos 2x + Co sin 21) : 


11.5 Exercises 


1. Determine the general integral of these ODEs with separable variables: 
+ 2)y 
_ 1 1 2 b —_ (x 
a) y' = clog(1 +2”) Dl esces 
2 


1 
fo ——. d) y’ = #/2y + 3tan’ x 


x log x 7 x log x 


2. Find the general solution of the homogeneous ODEs: 


Ag?y! = y? + 6xry — 327 b) ay =a? +4y? + yz 
c) ayy’ =a? ty? xy’ —yPretlY = ay 
3. Solve in full generality the linear ODEs: 
1 3x + 2 
‘4 3¢y = 23 b)| y/=-y- 
a) y +3ay =a ao aaa 
22 —y Qa? 
/ _ d / — 
x—1 a ca es 1+ 2? 


Write the particular solution of the equation 


pre 
Y~ Or+1 


such that y(0) = 1. 


Establish whether the differential equation 
y’ — —2y + e2e 


has solutions with vanishing derivative at the origin. 


11.5 Exercises 413 


6. Solve, over the ray [</e, +00), the initial value problem 


e¥y’ = 4x7 log x(1 + e”) 
y(Ve) = 0. 


Find the solutions of 


, 3a | 
y(0) = —1 
that are defined on the interval (—2, 2). 
Determine the general integral of 
is us 
y sin2xz — 2(y+cosz) =0, x € (0, =), 
and indicate the solution that stays bounded when x > 3. 
For a €R, solve the ODE 
y = (2+a)y —2e 
+oo 
so that y(0) = 3. Tell which values of a make the improper integral | y(x) da 
converge. e 
Let a, b be real numbers. Solve the initial value problem 
y’ =a +32° 
a 
y(2) =1 
restricted to the half-line [2, +00). 
Consider the parametric differential equation 
y (x) = —3ry(x) + ka 
depending on k € R. 
a) Find the solution with a zero at the origin. 
b) For such solution y determine k so that y(x) ~ x? as x — 0. 


12. Given the ODE : 
pp Yo 2y—3 
— 8(1 +42) ’ 
determine: 
a) the general integral; 
b) the particular integral yo(x) satisfying yo(0) = 1; 
c) Maclaurin’s expansion of yo(x) up to second order. 


414 11 Ordinary differential equations 


13. Work out the general solution to the following second order ODEs by reducing 
them to the first order: 


a) y” = 2e” by] y” +y/—27 =0 

14. Compute the general integral of the linear ODEs of the second order: 
a) y+ 3y'+2y=2741 'b) | y" — 4y! + 4y = &* 
y' +y=3cosz d) y” —3y' + 2y =e" 
y” —9y=e * f) y” —2y' — 3y = sinz 


15. Solve the initial value problems: 


y” + 2y' + 5y = 0 y” —5y'’+4y = 244+1 
a) ¢ y(0) =0 y(0) = ¢ 
y'(0) =2 y'(0) =0 


11.5.1 Solutions 


1. ODEs with separable variables: 
a) y= $(1+27) log(1 + x?) — $2? +C. 


b) The map h(y) = y has a zero at y = 0, which is consequently a singular integral. 
Suppose then y # 0 and separate the variables: 


1 xr+2 
“dy= | ——d 
ie car 


We compute the integral on the right by partial fractions: 
C42 A B 2 1 


x(a + 1) rae se ae | 


implies 
+2 2 1 
‘= = — dx = 21 —] 1|+ logC 
laa iy IG =) ic og |x| — log |a + 1| + log 
Ca 
= 16 , C>0. 
Sir 
Thus C22 
x 
lo lo ; C>O0, 
gly Siti 
2 
Wl=C——, O>0, 


d) y=—-3 +45 [$ (tang —x+ a plus the constant solution y = —3. 


11.5 Exercises 415 


a 


r+1? 
The singular integral y = 0 belongs to this family if we allow the value C' = 0. 


y=C C#0. 


The problem requires x > 0 due to the presence of the logarithm. Rearranging 
the equation as 


ri y? Beil 
Yy —— 
flog x 
yields h(y) = y? — 1. Thus the constant maps y = 1, y = —1 are singular 


solutions. Let now y #4 +1; separating the variables returns 


1 1 
—— dy = d 
Jas @ /as as 


The method of partial fractions in the left integral and the substitution t = log x 
on the right give 


1 =i 
- log a = log | log z| + log C = log C| log z] , G> 0, 
2 y+ 
equivalent to 
log yal = log C log’ x, C>O0, 
yt+1 
or 
y—1 


ya = Ces, C#0; 


Altogether, the general integral is 


_ 1+ Clog? x 


== CeER, 
1 — Clog? x 


which includes the singular solution y = 1 for C' = 0. 


3 
2 


2. Homogeneous differential equations: 


a) Supposing x 4 0 and dividing by 42? gives 


ly? 3y 38 
fe ee —— — oo 
Y4e Oe 4 
By renaming z = ¥ we have y’ = z + x2’, hence 
Pay er ee 
4 - Ae 


A4xz' = (z-—1)(z+3). 


416 11 Ordinary differential equations 


Because y(z) = (z — 1)(z +3) has zeroes z = 1, z = —3, the maps y = & 


y = —3a are singular integrals. For the general solution let us separate the 
variables 7 ; 
——_——- dz = } -dz. 
[mea i 1 . 
Decomposing 
4 A B 1 1 


(z —1)(z +3) pa ae a 


the right-hand-side integral is 


/ : d / : DS iyo |e 
—_——___dz= — —]dz= as 
(@—-D(e+3) z—-1 2+3 Pte 
Therefore i 
log — = log C|z], CS 0, 
z—l 
=C CH#0 
<——=Cz, #0, 
14+ 3Cx 
CeER; 
. 1—Czxz’ : 


this incorporates the singular solution z = 1 as well. Returning to the variable 
y, the general integral reads 


_ @£+3Cx? 
~ LS Ce * 


oy ae sx tan (2logC|z|) , C>0; ce) y=te,/2logClz|, C >0. 
d) If x 4 0 we divide by x” 


CeER. 


2 
y! _ aoe de . ; 
Changing z = % gives y' = z+ £z’, so 
ztaz! = 27el/*% + 2, 


whence 
az! = 22el/?, 


The function z = 0, corresponding to y = 0, is a singular integral of the ODE. 
By separation of the variables 


—1/z 
[5 : de = f Zae, 
Zz x 


integrating which, 


11.5 Exercises 417 


e1/? = logC||, C>0, 
= log log C|z]| , C>0O, 

1 
———_—__—_ C>0 
log log C\z| ’ oes 

1.€., 

xr 

= ———_—____, C>0. 
A log log C|z| 


in terms of y. 


3. Linear ODEs: 
a) y= (a? -2)4+Ce3™. 
b) Using (11.18) with a(z) = —+4 and d(x) = —344 produces 


yaoltde f oS te (-==*) dar = ella f 8 (-==*) a 
a x 


2) —(3 2 
we f AD ap ng f OAD 
a cn? 


II 
8 
aa. 
Z| 
w 
| 
8 | wo 
So 
oO. 
8 
I| 
8 
oN 
iw) 
bo 
+ 
ew 
RS] bo 
w 
+ 
Q 
en 


3 2 
== C CeR 
Dn age 
c) By writing 
_ 24 
i? g=9 
we recognise formula (11.18) where a(x) = 4, b(z) = 24. Then 


ae aie fone aan 
— GG = 


1 1 
—1 2n05 = : R. 
aaa | — x dx eh +C), Ce 


zr—1 
d) y= 2zarctanz+Czrz, CER. 


4. The equation has separable variables, but the constant solution y = 0 is not 
admissible, for it does not meet the condition y(0) = 1. So we write 


Renaming ¢ = e~¥ (hence dt = —e~¥dy, —+ dt = dy) implies 


418 11 Ordinary differential equations 


[roe ft | (7) 


1 
1-7] +e=log|t—e"| + 


t—1 
= log ae +c = log 


Then i 
log |1 — e¥| = 5 log [2a + 1| + log’, GS, 


log |1 — e?| = log Cy |2z + 1], C0, 
jl1—e#¥| =CyYl224+1], C>O, 
l-e=Ci/|22+1|, CHO. 


In conclusion, the general integral 


y=log(1-CyVPe +1) , CeR, 


also takes into account y = 0, corresponding to C = 0. 
Now to the condition y(0) = 1: from C = 1 — e the required solution is 


y =log (1+ (e- 1) 2x +1). 
5. The general integral of the linear equation reads 
yaer $2 f of Bax— 2 de =e (2 +0), CeER. 


The constraint is y’(0) = 0. Putting 2 = 0 in y'(x) = —2y(x) +e~?”, that becomes 
y(0) = 4, and implies C = $. The final solution is thus 


1 
_ -2a a 
y=e (2+). 


6. y = log (2<%*oe2—4) — 1) : 


7. When x € (—2, 2), 22 —4 < 0. The initial condition y(0) = —1 allows moreover 
to restrict to y(x) < 0 in a neighbourhood of x = 0. That said, we separate 


variables: i a 
x 
ae =f ag 
/ Ta / p40 


3 
— log |y| = —log(-—y) = 5 log |x? —4|/+C, CER, 


a dag rr, C0, 


11.5 Exercises 419 
y = C(4— 27) 3/2, GC =.0; 
Since y(0) = —1, C must be —8 and the solution reads 


= 8 
y= (4 = a2)3/2° 


Notice that the constant function y = 0 was disregarded because it fails to meet 
y(0) = —1. 


8. The duplication formula sin 2x2 = 2 sin x cos x bestows 
y'sinxcosz = y+ cosz. 


For x € (0, 4) we have sinxcosxz # 0, and so 
2 


1 1 
= 
sin Z Cos x sin x 


Yy => 
This is a linear equation, with integral 


1 _ 1 1 
y= el moma fe S aupesz dt. dz. 


Let us compute 


1 
SIN v COS & 


by setting t = sinz (dt = cosxdz, cos?x = 1 — t?) and integrating the rational 
function thus obtained: 


s= fmm taf (e+aa-p- aaa) # 


1 1 
= log |] — s|L—#]— 5 log|l +t/ +e 


lo a Wi + lo ee + E (0 =) 
= C= Cc, x 9S . 
[1 — ¢?| cosa 2 


Then i 
y= Ef ae = (- +0), CeR, 
cosx J sin* x cos x sina 
and the solution to the ODE is 
Csinz —1 
cos 2 


We need to find a bounded solution around =o 


Csinx —1 
m ——eER. 
e+E- cose 


420 11 Ordinary differential equations 


But 


ing — _ = 2 
jf Csing—1 _ lim 1 Cost _ ey 1—C(1 + o(¢*)) 


e+=- cosT t30- ~— Sint t+0- i + o(é?) 


if and only if C = 1, so the desired solution is 


sinz —1 
cosL 


9. The equation is linear, so at once 
y= ed (24a) dx fe f(Qt+ea) dt (eo dr 
eral (ert? 4 CO\=e"14+Ce™), CER. 
From y(0) = 3 it follows 3 = 1+C, so C = 2. The solution we want is thus 
y = e**(1 + 2c”). 
The improper integral 


+oo 
| (6? er") dy 
0 


converges precisely when the exponential of bigger magnitude has negative expo- 
nent. Therefore the integral converges if a < —2. 


10. Directly from the formula for linear equations, 


yaertee (3 [eve #82! ax) = 2° (3 [ «*ac] 


a (<5 aot 4G) ifb-—aZFz-l, 


—4 b-—a+l1 
x* (3logz + C) ifb-—a=-l, 
3 
_ aa Ce ifb-—aF¥ —-1, 
32° log x + Cx ifb-—az=-—l. 
Imposing y(2) = 1, 
3 b ; 
LO el fob-af#~-1 
b-a+l1 i : ag 1; 
3-2°%log2+C2%=1 ifb-—a=-—l, 


whence the constant is respectively 

3 
b—a+l1 
C=27° —3log2 ifb-—a=-l. 


C=? (1- ye) ifb—a¥~—-1, 


11.5 Exercises 


In conclusion, 


3 b+1 = 3 b+1 : 
————_— 2-* | 1 - ————_.2 i fb— —1 
rom 6 ro ees ( 6a 1 a a 
32° log x + ( — 3 log 2) a ifb-—a=-l. 


11. The ODE y' (x) = —3ay(x) + ka: 


a) The equation is linear, with integral 


y ee aie kx dx 


k k 
ra € er of c) mao aid Cet, CER. 


The condition y(0) = 0 forces 0 = £ + C, so C = —&, and the solution is 


(oP) 


b) The solution must now fulfill 


k 
= (1-07 3") ~ 7 as x > 0. 


3 
But : 
Brae = 5 + o(a?) for s > 0 
implies 
k 3 k 
y(x) = 3 (1 md xt 7 o(e?)) = a +o(z”)  forz—0. 


Therefore y is fixed by f =1,18, hk =2. 


12. Solution of y! = oe 


») (2) = St OV I+ ae 
ee Ie 


3—|1+4 
b) re omen Ac c) To(x) =1— 2x + 4x? + o(z?). 
1+ /|1+42| 


13. Second order linear ODEs reducible to first order: 
a) y = 2e° + Cyx + Co, Cy, Co ER. 


with C € R, and the constant y(x) = —1. 


A421 


422 11 Ordinary differential equations 


b) We define z = y’ so that to obtain the linear equation of the first order 


g+z=27, 


solved by 
i oSé fof ten? ae = of ate? ar. 
By integration by parts (twice), 
Z=>e °° (x7e* — 2xe” + 20” + C1) =7 —99+94Cie", CreR: 


Integrating once more to go back to y gives 


1 
y= guia +2n+Cie* +02, Ci,Co ER. 


14. Linear ODEs of the second order: 


a) y=Cie* a Csae* + ai _ 34 2, Cy,Co ER. 

b) Let us solve first the homogeneous equation. The characteristic polynomial 
dA? —4 +4, = 0 admits a unique root A = 2 with multiplicity two; the integral 
is then 

yo(a; C1, C2) = (C1 + Cor)e?*, C1,C2 ER. 
As = \ = 2, we require the particular integral to resemble y,(r) = axe”. 
Differentiating and substituting, 


2 


forces a = $. Thus y,(x) = $27e”", and the general solution to the ODE is 


1 
y(x; C1, C2) = (Cy + Cox)e?* + sue , Cy,CoeER. 


c) The characteristic equation A? + 1 = 0 has discriminant A = —4, hence o = 0, 
w = 1, making 


yo(#; C1, C2) = Ci cosx + Cosinaz, Cy,C2 ER, 


the general solution of the homogeneous case. Since 4 = 0 = 0 we want a 
particular integral y,(x) = «(acosx+ Bsinx). This gives 


—2asinz + 26cosx = 3cosz, 


hence a = 0 and 8 = 3, and in turn yp)(“) = 3x sin x. Thus 


3 
y(x;C,C2) = Ci cosx + Cosina + gzsing, Ci,Co ER. 


d) 
e) 


11.5 Exercises 423 


y = Cie” 4: Gee?” — xe”, Ci,Co ER. 


\ = +3 solve the characteristic equation \? — 9 = 0, so 
yo(x; C1, C2) =Car" +t Cse™ , C1,C2 ER, 


is how the integral of the homogeneous equation looks like. We are seeking a 
particular integral y,(2) = ave~3”. In the usual way 


—Gae 3” = e 3 
from which a = —4 follows. The particular solution yp(x) = —Zxe~%" is as- 
similated into the general integral 
—32 3x 1 —3x 
y(x;C1,C2) = Cie + Coe?" — gre , C1,CoeER. 


A 


y = Cie * + Coe + 7h COs x — sing, C1, C2 € 


. Initial value problems: 


y=e “sin2er. 
We start from the homogeneous ODE, and solve the characteristic equation 
A? —5\ +4 =0, which has roots \ = 1, A = 4. In this way 

yo(a; C1, C2) = Cre? + Coe**, C1,C2 ER, 


is the general expression for the integral. A particular solution y,(z) = ax + 6 
furnishes 
—Sa+4or+4f = 2741, 


p= t. In this way we subsume y,(x) = Su + 2 into the general 


integral 


1 7 
y(a; C1, C2) = Ce* + Coe** + at + 3? C1,Co ER. 


The initial conditions lead to the system 


Cy, +Co =0 
1 
Its solutions C) = z, C2 =-% now give 
| ere Oe | 7 
y= +e" — 6" += +=. 


6 6 2 8 


Appendices 


A.1 


The Principle of Mathematical Induction 


The Principle of Induction represents a useful rule for proving properties that hold 
for any integer number n, or possibly from a certain integer no € N onwards. 


Theorem A.1.1 (Principle of Mathematical Induction) Let no > 0 be 
an integer and denote by P(n) a predicate defined for every integer n > no. 
Suppose the following conditions hold: 


i) P(no) is true; 
ii) for anyn > no, tf P(n) is true then P(n +1) is true. 


Then P(n) is true for all integers n > no. 


Proof. The proof relies on the fact that each non-empty subset of N admits a 
minimum element; this property, which is self-evident, may be deduced from the 
axioms that define the set N. 

Let us proceed by contradiction, and assume there is an integer n > no such 
that P(n) is false. This is the same as saying that the set 


F={neEN:n> no and P(n) is false} 


is not empty. Define nm = min F’. As P(n) is false, condition i) prevents n from 
being equal to no, so n > no. Therefore n — 1 > no, and P(n — 1) is true by 
definition of the minimum. But applying ii) with n = n — 1 implies that P(n) is 
true, that is, n ¢ F. This contradicts the fact that 7 is the minimum of F’. 


In practice, the Principle of Induction is employed as follows: one checks first 
that P(no) is true; then one assumes that P(n) is true for a generic n, and proves 
that P(n + 1) is true as well. 


As a first application of the Principle of Induction, let us prove Bernoulli’s 
inequality: For all r > —1, 


(l+r)”>1+nr, Nin 10. 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_Al, 
© Springer International Publishing Switzerland 2015 


428 A.1 The Principle of Mathematical Induction 


In this case, the predicate P(n) is given by “(1+r)" > 1+ nr”. For n = 0 we have 
(1+r)° =1=1+0r , hence P(0) holds. 

Assume the inequality is true for a given n and let us show it holds for n+ 1. 
Observing that 1 +r > 0, we have 


(l+r)"tt=(14+r)(1+r)”>(14+r)1+nr) 
=l+rt+nrt+nr?=14+(n4+1)r4+nr 
>14+(n+4+1)r, 


and thus the result. 


Recall that this inequality has been already established in Example 5.18 with 
another proof, which however is restricted to the case r > 0. 


The Principle of Induction allows us to prove various results given in previous 
chapters. Hereafter, we repeat their statements and add the corresponding proofs. 


> Proof of the Newton binomial expansion, p. 20 


so the relation holds. 
Let us now suppose that the formula is true for a generic n and verify that it 
remains true for the successive integer; the claim is thus 


n+1 
1 
(a+syett =o ¢ jenn. 


k=0 


We expand the n + 1-term 


by putting k + 1 = h in the second sum we obtain 


n n+l 

M\ n—kpk+1 _ Mm n+1—hph 
do (peter => (7 Jars 
k=0 h=1 


A.1 The Principle of Mathematical Induction 429 


and, going back to the original variable k, since h is merely a symbol, we obtain 


Therefore 
n/n ntl, 
(a+b)"t = d (;) gkph a: » ¢ _ ant soh 


_ ") n+110 : nm n nt1—k zk M\ Opnt1 
— a + >] (7) +( Je ha ("ars . 
é a4 k bad n 


Using (1.12), with n replaced by n + 1, and recalling that 


o)=t=("G) © G)at= Ga) 


we eventually find 


n+1 fn ek ee n+1)\ 9 1 
(apy _— ( arty > ( ) n+l b ( )a prt 
- 1 k 
=>" (" a ‘) n+1—kpk 
k=0 k 
i.e., the claim. oO 


> Proof of the Theorem of existence of zeroes, p. 109 


Theorem 4.23 (Existence of zeroes) Let f be a continuous map on a 
closed, bounded interval |a, b|. If f(a) f(b) < 0, i.e., of the images of the end- 


points under f have different signs, f admits a zero within the open inter- 
val (a, b). 
If moreover f is strictly monotone on [a,b], the zero is unique. 


Proof. We refer to the proof given on p. 109. Therein, it is enough to justify the 
existence of two sequences {a,,} and {b,}, finite or infinite, that fulfill the predicate 
P(n): 
[ao, 60] D [a1, 1] D ..- D [an, dn] , 

F(an) <O< fbn) and by — On = 

When n = 0, by assumption f(ao) = f(a) < 0 < f(b) = f(bo), so trivially we have 
bo — ao 

90 


bo — ao = 


430 A.1 The Principle of Mathematical Induction 


: : Gn +b 
Assume the above relations hold up to a certain n. Let cy, = <3 be the 


mid-point of the interval [a,,b,|. If f(c,) = 0, the construction of the sequences 
terminates, since a zero of the function is found. If f (cn) 4 0, let us verify P(n+1). 
If f(cen) > 0, we set Qn41 = Qn and bn41 = Cn, whereas if f(cn) < 0, we set 
AQn41 = Cn and bn41 = by. The interval [a,41,bn41] is a sub-interval of [an, by], 
and 


Qn tle 


f(ant1) <0< flOn41) and bn4i — An41 = 


> Proof of inequality (5.16), p. 140 
Let us begin by establishing the following general property. 


Property A.1.2 Let {bm}m>o0 be a sequence with non-negative terms. As- 
sume there exists a number r > 0 for which the following inequalities hold: 


Dei nae Vm > 0. 


Then one has 


Dee uae Ym > 0. 


Proof. We apply the Principle of Induction. For m = 0, the inequality is trivially 
true, since bo < r°bo = bo. 

Let us assume the inequality to be true for m and let us check it for m+ 1. 
Using the assumption, one has 


Keay, = Uy TT bg = r?tlby. 


If all terms of the sequence {b,}m>o are strictly positive, a similar statement holds 
with strict inequalities, i.e., with < replaced by <. 


Next, consider inequality (5.16). In order to derive the implication 
Open a, = (Cee le ce (0 et 


let us set by = @m+n-4+1 and observe that the assumption 
On <A Tas Vn > ne 


is equivalent to 
Cate < Tm, ~via 0: 


Thus, the previous property yields b,, < r™bo, whence we get (5.16) by choosing 
M=Nn—MNe. 


A.2 


Complements on limits and continuity 


In this appendix, we first state and prove some results about limits, that are used 
in the subsequent proof of Theorem 4.10 concerning the algebra of limits. Next, we 
rigorously justify the limit behaviour of the most important elementary functions 
at the extrema of their domain, and we check the continuity of these functions at 
all points they are defined. At last, we provide the proofs of several properties of 
Napier’s number stated in the text, and we show that this important number is 
irrational. 


A.2.1 Limits 


Now we discuss few results that will be useful later. 


Theorem A.2.1 (local boundedness) Jf a map f admits a finite limit for 
x — c, there exist a neighbourhood I(c) of c and a constant My > 0 such that 


Vee dom fife) \ {ce}, |f(@)| <M. 


Proof. Let @= lim f(x) € R; the definition of limit with, say, « = 1, implies the 
w—->C 


existence of a neighbourhood I(c) of c such that 
Va € dom f, xeEl(c)\{c} = |f(w)-2€ <1. 
By the triangle inequality (1.1), on such set 


|F(@)| = [f(a) —&+ 4 < fe) —4 + lel <1+(€. 


Therefore it is enough to choose My = 1 + ||. 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_A2, 
© Springer International Publishing Switzerland 2015 


432 A.2 Complements on limits and continuity 


Theorem A.2.2 (Theorem 4.2, strong form) If f admits non-zero limit 
(finite or infinite) for x + c, then there are a neighbourhood I(c) of c and a 
constant K ¢ > 0 for which 


Va € dom f NI (c) \ {ec}, Eee Se (A2.1) 


Proof. Let € = lim f(z). If 2 € R\ {0}, and given for instance « = |é|/2 in the 
LC 


definition of limit for f, there exists a neighbourhood I(c) with Vx € dom f NI(c) \ 
{c}, |f(x) — £| < |€|/2. Thus we have 


lel = |f(a) +£— f(a)| <|F@)|4 lF(@) — 4) < |F@)| +4 
hence ; : 
F@l>a- f=. 


The claim follows by taking Ky = a 

If € + oo, then lim |f(x)| = +00 and it is sufficient to take A = 1 in the 
xw—->C 

definition of limit to have |f(x)| > 1 in a neighbourhood J(c) of c; in this case we 


may take in fact Ky = 1. 


Remark A.2.3 Notice that if 2 > 0, Theorem 4.2 ensures that on a suitable 
neighbourhood of c, possibly excluding c itself, the function is positive. Therefore 
the inequality in (A.2.1) becomes the more precise f(x) > Ky. Similarly for 0 < 0, 
in which case (A.2.1) reads f(x) < —Ky. In this sense Theorem A.2.2 is stronger 
than Theorem 4.2. 


The next property makes checking a limit an easier task. 


Property A.2.4 In order to prove that lim f(x) = ¢ € R it is enough to find 
sie 


a constant C > 0 such that for every e > 0 there is a neighbourhood I(c) with 


Vx € dom f, fel(ey\{ch => |f@)— cd =< Ce, (A.2.2) 


Proof. Condition (3.8) follows indeed from (A.2.2) by choosing ¢/C instead of e. 


A.2.1 Limits 433 


> Proof of Theorem 4.10, p. 96 


Teorema 4.10 Suppose f admits limit £ (finite or infinite) and g admits limit 
m (finite or infinite) for x 4c. Then 


provided the right-hand-side expressions make sense. (In the last case one as- 
sumes g(x) #0 on some I(c)\{c}.) 


Proof. The cases we shall prove are: 
a) if £€ R and m = —oo, then lim (f(z) — g(x)) = +00; 
w>C 
b) if 2,m ER, then lim f(x)g(z) = 2m ER; 
w—>>C 


c) if 2,m € Rand m £0, then im J) _ 4 ex 
Zc g(x) m 
d) if@e€R\ {0} or €+co, and m =0, then ime Sas 
ac g(x) 


All remaining possibilities are left to the reader as exercise. 


a) Let A > 0 be arbitrarily fixed. 

By Theorem A.2.1 applied to f, there is a neighbourhood I’(c) of c and there is a 
constant My, > 0 such that for each « € dom f N I'(c) \ {c}, |f(x)| < My. 
Moreover, lim g(x) = —oo is the same as saying that for any B > 0 there is an 
I" (c) such that g(x) < —B for every x € dom gNI"(c)\{c}, ie, —g(a) > B. Choose 
B= A+M,f, and set I(c) = I'(c) NI" (c): then for all ¢ € dom f Ndom gNI(c) \{c}, 


f(a) -—g(z) > -Ms +B A. 


This proves that lim (f(a) — g(x)) = +00. 


w—->C 


b) Fixe > 0. 
Assuming lim f(a) = @ € R, as we have, tells 
LC 


AI'(c): Vx Eedomf, xe l’(c)\{c} = |f(xz)-& <e, 


while Theorem A.2.1 gives 


AI" (c), IMz > 0:Vx Edomf, cE I"(c)\{c} => |f(x)| < My,. 


434 A.2 Complements on limits and continuity 


Analogously, lim g(x) = m € R implies 
«w—->C 


AI (c) : V2 €domg, x EI’ (c)\{c} = |g(x)-m| <e. 


Set now I(c) = I'(c)NI" (ce) NI" (c); for all « € dom fN dom gN I(c) \ {c} we have 


(x) —m) + (f(x) — 2)m| 
S |f(@)lg(@) — m] + |f(@) — | |m] < (Mz + |mie. 
This means that (A.2.2) holds with C = M,y + |m|. 


c) Let ¢ > 0 be given. 
From lim f(x) = @ € R and lim g(x) = m€ R it follows 
zc xc 


AI'(c): Vax Edom f, x € I'(c)\{c} => |f(x)-ll <e 


and 
AI" (c) : Vx €domg, x E€I"(c)\ {c} = |g(x) —m| <e. 


Since m # 0 moreover, Theorem A.2.2 guarantees there is a neighbourhood I’ (c) 
of c together with a constant AK, > 0 such that |g(x)| > Ky, Vx € domg, x € 
Lc) \ {c}. 

Set I(c) = I'(ce) NI" (c) NI’ (c); then for all x € dom f MNdomg, x € I(c) \ {c} 


be _# | _ | f(a)m — tg(x)| _ |f(a)m — &m + bm — bg(x)| 
g(x) =m mg(x) |mI|g(x)| 
_ [(Fla) = Om + e(m = (a) | |F(w) = Alm + lel ig) — 
|mI|g(x)| 7 |mI|g(x)| 
Im| + |e). 
|m| Kg 
|m| + |e 

Hence (A.2.2) holds for C ney 


d) Fix a constant A > 0. 

Using Theorem A.2.2 on f we know there are a neighbourhood I'(c) of c and a 
Ky > 0 such that Vz € dom fn € I'(c) \ {c}, |f(x)| > Ky. 

By hypothesis lim g(x) = 0, so choosing ¢ = Ky/A ensures that there exists a 
neighbourhood I"(c) of c with |g(x)| < K,y/A, for any x € domg NI" (c) \ {c}. 
Now let I(c) = I'(c) NI" (c), so that for all e € dom f NdomgN I(c) \ {c} 


ae 
g(x) 


A 


= +00, which was our claim. 


This shows that lim | 
x 


Cc 


f(z) 
g(x) 


A.2.2 Elementary functions 435 


A.2.2 Elementary functions 


> Check of the limits in the table on p. 101 


lim x#* = +00, 
%—>-+00 


lime 2" =") 
Z—>-++00 


Giles io. SCE e =e Op An F ane 
ee 
ECO Dy, w™ - ... + 01% + bo Bm £00 


m 


lim a* = +00, lini: <a" =O) 
t—-+00 L—>—0o 


lim. a? = 0)., lim a” = +00 
t— +00 L—>—0o 


Ih Oj ie = Seo Ieee Weis. ve = Sic 9 
L— OO xz—0r 


lim log, © =—oo, ling tos 7 — co 
D+ +0o x—0T 


lim sina, i lim tanz do not exist 
L— I 0O i Lx CO 


lim tan = ==o0. 
a>(Z+kr)~ 


: : T : 
lim arcsing = +— = arcsin(+1) 
6625 2 


lim arccosa — Ui — arecos I, lim arccosx = m7 = arccos(—1) 
ie eopll c= 


. T 
lim arctanz = += 
Sp; cele) 2 


Proof. 


a) Take the first limit. Fix A > 0 and set B = A!/* > 0. As power functions are 
monotone, 
VeRs, o2S>b > 2° > b° =A, 


so the requirement for the limit to hold (Definition 3.12) is satisfied. 


As for the second limit, with a given ¢ > 0 we let 6 = ¢!/*; again by mono- 


tonicity we have 
yoo. g20. = 2° Sa" Se; 


The condition of Definition 3.15 holds. 


436 A.2 Complements on limits and continuity 


b) These limits follow from a) by substituting z = 1, which gives 7% = + . The 
algebra of limits and the Substitution theorem 4.15 allow to conclude. 

c) The formula was proved in Example 4.14 iii). 

d) Put a = 1+, with b > 0, in the first limit and use Bernoulli’s inequality 
a” = (1+06)" > 1+ nb. Fix an arbitrary A > 0 and let n € N be such that 


1+ nb > A. Since the exponential is monotone we obtain 


VeaER, t>n => a >a”>1ltinbd>A, 


hence the condition of Definition 3.12 holds for B =n. 
The second limit is a consequence of the first, for 


1 
lim a* = lim = = = 0 
——0o L>-—co Qt lim a 
Z—>+00 
: : . 1 
e) These descend from d) using the identity a” = Tay 
a 


f) The limits of d) and Corollary 4.30 imply that the range of y = a” is the interval 
(0, +00). Therefore the inverse y = log, x is well defined on (0,+00), and 
strictly increasing because inverse of a likewise map; its range is (—oo, +00). 
The claim then follows from Theorem 3.27. 

g) A consequence of e), for the same reason as above. 

h) That the first limit does not exist was already observed in Remark 4.19. Ina 
similar way one can discuss the remaining cases. 

More generally, notice that a non-constant periodic function does not admit 
limit for x — too. 

i) Follows from the algebra of limits. 

1)-m) The functions are continuous at the limit points (Theorem 4.33), making the 
results clear. 

n) We can argue as in f) relatively to y = tanz, restricted to (— 
inverse map y = arctan Zz. 


> Proof of Proposition 3.20, p. 81 


Proposition 3.20 All elementary functions (polynomials, rational functions, 


powers, trigonometric functions, exponentials and their inverses) are continu- 
ous over their entire domains. 


Proof. The continuity of rational functions was established in Corollary 4.12. 
That, together with Theorems 4.17 and 4.33 on composites and inverses, implies in 
particular that power functions with positive rational exponent y = v’"/" = Wa" 
are continuous; the same holds for negative rational exponent «7? = — by using 


the algebra of limits. At last, powers with irrational exponent are continuous by 


A.2.3 Napier’s number 437 


definition «* = e®!°8” and because of Theorem 4.17, once the logarithm and the 
exponential function have been proven continuous. 

As for sine and cosine, their continuity was ascertained in Example 3.17 iii), so 
the algebra of limits warrants continuity to the tangent and cotangent functions; by 
Theorem 4.33 we infer that the inverse trigonometric functions arcsine, arccosine, 
arctangent and arccotangent are continuous as well. 

What remains to show is then the continuity of the exponential map only, 
because that of the logarithm will follow from Theorem 4.33. Let us consider the 
case a > 1, for if 0 < a < 1, we can write a” = Wa and use the same argument. 
The identities 


%1+2%2 v1,,02 —2£ 


a =a a, QS — 


and the monotonicity 
fot. = “aa” 


follow easily from the properties of integer powers and their inverses when the 
exponents are rational; for real exponents, we can apply the same argument using 
the definitions of exponential function and supremum. 
First of all let us prove that y = a® is continuous at the right of the origin 
laa 1. (A.2.3) 


xz—0+ 


With ¢ > 0 fixed, we shall determine a 6 > 0 such that 
O<@r7<6 => O<a*-l<e., 


The exponential map being monotone, it suffices to find 5 with a® — 1 < ¢, ie., 
a® < 1+. Searching for 6 of the form 6 = 1, with n integer, the condition 
becomes a < (1+ 6)”. Bernoulli’s inequality (5.15) implies (1 + ¢)” > 1+ ne. It 
is therefore enough to pick n so that 1+ ne >a, or n > +. Thus (A.2.3) holds. 


Left-continuity at the origin is a consequence of 


' . . 1 1 
lim a® = lim a~‘-* = lim = Se 
x—0- x2—0- xz >0- a” lim a 
z>0t 


so the exponential map is indeed continuous at the origin. Eventually, from 


lim a® = lim a%t@-*0) — g® jim a®~*° = a®° lim a? = a®° 
L>Xo L->Xo xL—->Xo z—0 : 


we deduce that the function is continuous at every point xo € R. 


A.2.3 Napier’s number 


1 n 

We shall prove some properties of the sequence a, = (1 + =| , n> 0, defining 
n 

the Napier’s number e (p. 72). 


438 A.2 Complements on limits and continuity 


Property A.2.5 The sequence {a,,} is strictly increasing. 


Proof. Using Newton’s formula (1.13) and (1.11), we may write 


nm 


= () -EQ)a- Eee 


| 
> 
| 3 
°o 
Ft 
— 
a 
— 
| 
Slr 
NY 
aN 
= 
| 
ea 
3] | 
— 
NY 
— 
> 
bo 
oo 
“" 


similarly, 


n+1 
1 1 k—-1 
oma= gl (I-mg) (1-55), (A.2.5) 


We note that 


Ga acer re eee ere ce 


so each summand of (A.2.4) is smaller than the corresponding term in (A.2.5). 
The latter sum, moreover, contains an additional positive summand labelled by 
k =n+1. Therefore an, < an+41 for each n. 


Property A.2.6 The sequence {a,,} is bounded; precisely, 


2 <0 = 34 Wa alle 


Proof. Since a; = 2, and the sequence is strictly monotone by the previous prop- 
erty, we have a, > 2, Vn > 1. Let us show that a, <3, Vn > 1. By (A.2.4), and 
observing that k! =1-2-3---k>1-2-2---2=2*-1 it follows 


ae , (: -) (: “—*) < oa | 
an = — . — eee —_ ———_ = 
tao F! n n er 


n 1 n 1 a 4 
=14+5s1t+Vag alt 
k=1 k=1 k=0 


We conclude that a,, < 3. 


A.2.3 Napier’s number 439 


> Proof of Property 4.20, p. 105 


Property 4.20 The following limit holds 


1 xz 
lim (1 + =| =e. 
§0 25S) OG 


Proof. We start by considering the limit for z — +o0. Denoting by n = [a] the 
integer part of x (see Examples 2.1), by definition n < « < n+ 1; from that it 


| iL i il 
< — < -, in other words 1 + —— <1+-—<1+-. The familiar 
nm+1léan27 n n+l £ n 
features of power functions yield 


1+ : - 1+ : - ae: “2 rae - 14: - 
n+1 = n+1 x _ n n 


hence 


(14 LY (se HY < (te 2) <(r42)" (142). are 


When zx tends to +00, n does the same. Using (3.3) we have 


i 1 Ly" 1 
lim (: = ~) (: =F ~ | = lim € = ~) lim (: ar ~) =e 
n—>+oo n n n—+0o0 n n—->+0o n 


the substitution m = n+ 1 similarly gives 


1 n+l 1 —1 
li 1 1 = 
tim, ( +] ( +] : 


Applying the Second comparison theorem 4.5 to the three functions in (A.2.6) 
proves the claim for x + +00. Now let us look at the case when x tends to —oo. 
If x <0 we can write x = —|z|, so 


(2) 0-8)" eat) ra)” 


Set y = |x| — 1 and note y tends to +00 as x goes to —oo. Therefore, 


1? ier 1\¥ 1 
lim (1 + -) = lim (1 + - | = lim (1 + | lim (1 + - | =e. 
wL—>—Co v y— +00 Yy y+oo y y—+oo y 


This concludes the proof. 


follows 


440 A.2 Complements on limits and continuity 


> Proof of the irrationality of Napier’s number, p. 72 


Property A.2.7 Napier’s number e is irrational, and hes between 2 and 3. 


Proof. Based on the First comparison theorem for sequences (p. 137, Theorem 
4), from the previous property we quickly deduce 


2<e<3. (A.2.7) 


Suppose, by contradiction, that e is a rational number, so that there exist two 
m 
integers mo and no # 0 such that e = —"° Recall that for any n > 0 


70) 
3 Lj 6 (heed 
e= — 5 rn 
kl! (n+1)! 
k=0 
(see Remark 7.4). From this, 
mo “nl etn 
le = n!— = — : A.2.8 
nie=n a ys il + n+l ( ) 


As the exponential map is monotone, and using (A.2.7), we deduce 


l= 2 et 2 eS 2 


nm 
mo n! 
Choosing now n > max(3,70), the numbers n!— and y A are integers, 
no i 
k=0 


ev 


whereas i lies in the open interval between 0 and 1. The identity (A.2.8) then 


must be false, so e is irrational and equality in (A.2.7) never occurs. O 


A.3 


Complements on the global features of continuous 
maps 


We first introduce the concept of subsequence of a given sequence, and establish a 
number of related properties. Among them, the Theorem of Bolzano-Weierstrass, 
which is a fundamental ingredient in the subsequent proof of the Theorem of 
Weierstrass concerning continuous functions on an interval of the real line; the 
proofs of other results for such functions are also provided. The appendix ends with 
the definition of uniform continuity and the discussion of some of its properties; 
these concepts will find application to the study of integral calculus and differential 
equations. 


A.3.1 Subsequences 


Theorem 2 on p. 137 states that every converging sequence is bounded. In general, 
though, the opposite implication is false. In fact, the sequence a, = (—1)” does not 
converge despite being bounded (|a,| = 1, Vn). But if we take just the elements 
with even subscript, we obtain the constant sequence {by }x>0 where by = a2 = 
1, k > 0, which is patently convergent. Similarly if we take odd indexes only: 
the constant sequence {cx}x>0 with cy = a2r41 = —1, k > 0, converges. Such 
sequences have been extracted, so to say, from the initial sequence {ay }n>0, in the 
sense formalised below. 


Definition A.3.1 Let {an}n>n+ be a sequence and {nx}xK>0 @ strictly in- 


creasing sequence of integers > n*. The sequence {an,}n>0 ts said sub- 
sequence of {dn}n>n*- 


Observe that the sequence {a,, }x>o0 is a composite function, for it is obtained 
by composing the map k +> nz with n 6 ap. 

Any subsequence of a converging or diverging sequence preserves the limit 
behaviour of the ‘mother’ sequence: 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_A3, 
© Springer International Publishing Switzerland 2015 


442 A.3 Complements on the global features of continuous maps 


Proposition A.3.2 Let the sequence {an}n>n« admit limit lim oe 
a n—--+00 


finite or infinite. Then for any subsequence {an,}r>0 


Kier ee 
k->+00 


Proof. It is not that difficult to see, by induction, that 
ie Iss Vk >0. (A.3.1) 


Clearly, no > 0; supposing nz > k we have nz41 > nz because the sequence is 
strictly increasing. That in turn implies nz+1 > k +1, whence the claim follows. 
Due to the First comparison theorem (p. 137, Theorem 4), the inequality 
(A.3.1) tells that the sequence {n;} diverges to +00. The result then follows from 
Theorem 4.15 adapted to sequences (whose proof is similar to the one given on 
p. 102). 


The fact that one can extract a converging subsequence from a bounded se- 
quence, as we showed with the example a,, = (—1)”, is a general and deep result. 
This is how it goes. 


Theorem A.3.3 (Bolzano-Weierstrass) A bounded sequence always ad- 


mits a converging subsequence. 


Proof. Suppose {%n}n>n* is a bounded sequence by assuming 
Q<an<b, Wn>n'*, 


for suitable a,b € R. We shall bisect the interval [a,b] over and over, as in the 
proof of Theorem 4.23 of existence of zeroes. Set 


in =a bo = b, No = {n>n*}, no =n". 
Call co the midpoint of [ag, bo] and define the sets 
No = {n eNotes [ao, col} , Ne = {n E No: fn € [co, bo] } ‘ 


Note No =.No UN,o', where at least one of No , N° must be infinite because No 
is. If No is infinite, set 


a, = a0, bj =©o, NieawG 


otherwise, 
ai =Co, Oy = Dos NaN 


Now let n1 be the first index > ng contained 1; we can make such a choice since 
N; is infinite. Iterating the procedure (as always, in these situations, the Principle 


A.3.2 Continuous functions on an interval 443 


of Induction A.1.1 is required in order to make things formal), we can build a 
sequence of intervals 


bo — ao 
Qk? 


aes06| > (@x04)] see [eps be] Sass s with b,—-—a, = 


a sequence of infinite sets 


No DM Dan DING Dsus 


and a strictly increasing sequence of indices {nx }x>0, Nx € Nz, such that 
de =< Day. = UE; Vk > 0. 
Then just as in the proof of Theorem 4.23, there will be a unique ¢ € [a, }] satisfying 
pare = partes a 


From the Second comparison theorem (Theorem 5 on p. 137) we deduce that the 
sequence {Xn,}e>0, extracted from {%p}n>n*, converges to &. O 


A.3.2 Continuous functions on an interval 


> Proof of the Theorem of Weierstrass, p. 114 


Theorem 4.31 (Weierstrass) A continuous map f on a closed and bounded 
interval [a,b] is bounded and admits minimum and marimum 


clas and Vie max (ar); 
xE[a,b] xeE[a,b] 


Consequently, 


f (la, b)) = [m, M]. 


Proof. We will show first that f admits a maximum in |a, }], in other words there 
exists € € [a,b] such that f(x) < f(€), Vx € [a,b]. For this, let 


M = sup f (la, b)), 


which is allowed to be both real or +o0. In the former case the characterisation of 
the supremum, (1.7) ii), tells that for any n > 1 there is x, € [a,b] with 


Wale halved 
nmr 


444 A.3 Complements on the global features of continuous maps 


Letting n go to +00, from the Second comparison theorem (Theorem 5 on p. 137) 
we infer 

Jim, fa, )=M, 
In the other case, by definition of unbounded set (from above) we deduce that for 
any n > 1 there is x, € [a,b] such that 


Ia) 20: 
The Second comparison theorem implies 
lim f(%n) = +00 = M. 
Noo 


In either situation, the sequence {%,}n>1 thus defined is bounded (it is contained 
in [a,b]). We are then entitled to use Theorem of Bolzano-Weierstrass and call 
{%n,}k>o a Convergent subsequence. Let € be its limit; since all rp, belong to 
[a, b], necessarily € € [a,b]. But {f(xn,)}x>0 is a subsequence of {f(x,)}n>0, So 
by Proposition A.3.2 

lim fF (ty) =A, 


k-> oo 


The continuity of f at € implies 


f(€) = f( lim Ln, ) = jim f(en,) = M, 


k-oo 


which tells us that M cannot be +oo. Moreover, M belongs to the range of f, 
hence 
M = max f([a, 5)). 


Arguing in a similar fashion one proves that the number 


m = min f ({a, }]) 


exists and is finite. The final claim is a consequence of Corollary 4.30. 


> Proof of Corollary 4.25, p. 111 


Corollary 4.25 Let f be continuous on the interval I and suppose it admits 
non-zero limits (finite or infinite) that are different in sign for x tending to 


the end-points of I. Then f has a zero in I, which is unique if f is strictly 
monotone on I. 


Proof. We indicate by a, ( (finite or not) the end-points of J and call 


lite’ fo) Sas and lim. f(g) = 15. 


rat xr B- 


A.3.2 Continuous functions on an interval 445 


Should one end-point, or both, be infinite, these writings denote the usual limits 
at infinity. 

We suppose f, <0 < ég, for otherwise we can swap the roles of £4 and fg. By 
Theorem 4.2 there exist a right neighbourhood J* (q@) of a and a left neighbourhood 
I~ (G8) of 6 such that 


V2 €I*(a), f(x) <0 and Va €1-(8), f(x) >0. 


Let us fix points a € I*(a) and b € I~ (8) with a < a, b < GB. The interval [a, }] is 
contained in J, hence f is continuous on it, and by construction f(a) <0 < f(b). 
Therefore f will vanish somewhere, in virtue of Theorem 4.23 (existence of zeroes) 
applied to |a, b]. 

If f is strictly monotone, uniqueness follows from Proposition 2.8 on the inter- 
val I. 


> Proof of Theorems 4.32 and 4.33, p. 114 


Let us prove a preliminary result before proceeding. 


Lemma A.3.4 Let f be continuous and invertible on an interval I. For any 
chosen points x1 < x2 < x3 in I, then one, and only one, of 


Pleiy f ee (ae) 


Pea fas fas) 


Proof. As f is invertible, hence one-to-one, the images f(x1) and f(x3) cannot 
coincide. Then either f(z1) < f(#g3) or f(z1) > f(#3), and we claim that these 
cases imply (7) or (zi), respectively. 

Suppose f(x) < f(#3), and assume by contradiction that (7) is false, so f(x2) 
does not lie strictly between f(x1) and f(x3). For instance, 


f(z1) < f(a3) < f(e2) 


(if f(ae) < f(vi) < f(s) the argument is the same). As f is continuous on the 
closed interval [1,22] C J, the Intermediate value theorem 4.29 prescribes that it 
will assume every value between f(x,) and f(#2) on [21,22]. In particular, there 
will be a point Z € (#1, 22) such that 


f(z) = f(z), 


in contradiction to injectivity: Z and x3 are in fact distinct, because separated 
by 29. 


446 A.3 Complements on the global features of continuous maps 


Theorem 4.32 A continuous function f on an interval I is one-to-one if and 


only if it 1s strictly monotone. 


Proof. Thanks to Proposition 2.8 we only need to prove the implication 
f invertible onJ = ff strictly monotone on J. 


Letting 71 < 22 be arbitrary points of J, we claim that if f(a1) < f(x) then f 
is strictly increasing on I (f(x1) > f(x2) will similarly imply f strictly decreases 
on I). 

Let z1 < zg be points in J, and suppose both lie within (21,22); the other 
possibilities are dealt with in the same way. Hence we have 


Uy< 2% < 2% < Xo. 


Let us use Lemma A.3.4 on the triple x1, 21, 2: since we have assumed f (x1) < 
f (x2), it follows 


Fixit) < f(a) =F (wa): 
Now we employ the triple z1, z2, x2, to the effect that 
f(a) <f (ee) < f(e2). 


The first inequality in the above line tells f is strictly increasing, proving The- 
orem 4.32. 


Theorem 4.33 Let f be continuous and invertible on an interval I. Then the 


inverse f—' is continuous on the interval J = f (1). 


Proof. The first remark is that J is indeed an interval, by Corollary 4.30. Using 
Theorem 4.32 we deduce f is strictly monotone on I: to fix ideas, suppose it is 
strictly increasing (having f strictly decreasing would not change the proof). By 
definition of a monotone map we have that f~! is strictly increasing on J as well. 
But it is known that a monotone map admits at most discontinuities of the first 
kind (Corollary 3.28). We will show that f~! cannot have this type either. By 
contradiction, suppose there is a jump point yo = f(vo) € J = f(Z) for fot. 
Equivalently, let 
z = sup f*(y) = lim f-*(), 


y<Yo Y> Yo. 
+ _ = ; = 
27 = = lim ' 
0 = inf f(y) oe f(y) 


and suppose zg < z4- Then inside (z9 ,2¢) there will be at most one element 
zo = f—+(yo) of the range f~'(J). Thus f~!(J) is not an interval. By definition 
of J, on the other hand, f~'(J) = I is an interval by hypothesis. In conclusion, 
f—+ must be continuous at each point of J. 


A.3.3 Uniform continuity 447 


A.3.3 Uniform continuity 


Let the map f be defined on the real interval J. Recall f is called continuous on 
I if it is continuous at each point x9 € J, i.e., for any xo € J and any e€ > O there 
exists 0 > 0 such that 


Va € I, jz—azo]<d => |f(x)— f(zo)| <e. 


In general 6 = d(€, Zo), meaning that 6 depends on zo, too. But if, for fixed « > 0, 
we find 6 = d(e€) independent of xo € J, we say f is uniformly continuous on J. 
More precisely, 


Definition A.3.5 A function f is called uniformly continuous on J if, 
for any € > 0, there is ad > 0 satisfying 


Va', av” eT, ja’ — "|< 6 = (f(a) — f(2")| <e. (A.3.2) 


Examples A.3.6 
i) Let f(x) = x?, defined on I = [0,1]. Then 


f@’)- #2") =|"? - @")| = |e! +2"| 2" -2"| < 2a! — 2". 


If |x’ — x""| < § we see |f(x’) — f(a”)| < e, hence 6 = § fulfills (A.3.2) on J, 
rendering f uniformly continuous on I. 

ii) Take f(a) = x? on I = [0,+00). We want to prove by contradiction that f is 
not uniformly continuous on I. If it were, with « = 1 for example, there would 
be a 6 > O satisfying (A.3.2). Choose a’ € J and let 2” = a’ + g, so that 


Jz’ — a""| = £ < 6; then 
f(a!) — f(a")| = |e +2" [2 — 2") <1, 


or 
(20! +S) o <1. 


Now letting x’ tend to +00 we obtain a contradiction. 
iii) Consider f(x) = sinz on J=R. From 


yg’ — x" a! + al! 
COS 
2 


sina’ — sina” = 2sin 


we have 
/I | 


|sinz’ —sina”| < |a’ — 2"), Va',c" ER. 


With a fixed ¢ > 0, 6 = € satisfies the requirement for uniform continuity. 


448 A.3 Complements on the global features of continuous maps 


iv) Let f(x) = 4+ on I = (0, +00). Note that 
1 1 


a! 


gl! 


f(a") — f(2")| = 


By letting x’, x” tend to 0, one easily verifies that f cannot be uniformly 
continuous on I. 
But if we consider only I, = [a,+00), where a > 0 is fixed, then 


F(a’) — fF@")| < 


so 6 = a’e satisfies the requirement on J,, for any given € > 0. 


|! — a!" 
—  %  »9 


a2 


Are there conditions guaranteeing uniform continuity? One answer is provided 
by the following result. 


Theorem A.3.7 (Heine-Cantor) Let f be a continuous map on the closed 


and bounded interval I = |a,b|. Then f is uniformly continuous on I. 


Proof. Let us suppose f is not uniformly continuous on J. This means that there 
exists an € > 0 such that, for any 6 > 0, there are z’,x2” € I with |x’ —2”| <6 
and | f(x’) — f(#")| > e. Choosing 5 = +, n > 1, we find two sequences of points 
{xi }n>1, {2 }n>1 inside I such that 


iL 
IZ,—-2,|<— and = |f(x,) — f(w,)| Ze. 
n 
The condition on the left implies 
lim (2, — 2’) =0, 
noo 


while the one on the right implies that f(x/,) — f(x") will not tend to 0 for n > 
oo. On the other hand the sequence {z/,}n>1 is bounded, a < zi, < b for all 
n, so Theorem A.3.3, of Bolzano-Weierstrass, will give a subsequence {2}, }x>0 


converging to a certain % € I: 


Also the subsequence {2,, }k>0 converges to 2, for 
lim x) = lim [2 + (a —a')) = lim vi, + lim (a), -a2),)=%+0=2. 
k—+06 Nk jim | Ub ( Uk | k=3656 Uk jim ( Nk aa 


Now, f being continuous at Z, we have 
dim f(wi,,) = f( Jim a.) = f@ and dim f(a) = F( lim a,) = f(@). 
Then 

lim (f(2',,) — fa%,)) = F@ — f@) =0, 


k-oo 


contradicting the fact that 
f(z, )—fl(at J) ze>0, Wk>0. 


A.4 


Complements on differential calculus 


This appendix is entirely devoted to the proof of important results of differential 
calculus. We first justify the main derivation formulas, then we prove the Theorem 
of de l’Hopital. Our next argument is the study of differentiable and convex func- 
tions, for which we highlight logical links between convexity and certain properties 
of the first derivative. At last, we establish Taylor formulas with three forms of 
the remainder, i.e., Peano’s, Lagrange’s and the integral form. 


A.4.1 Derivation formulas 


> Proof of Theorem 6.4, p. 174 


Theorem 6.4 (Algebraic operations) Let f(x), g(x) be differentiable maps 


at to € R. Then the maps f(x) + g(x), f(x)g(x) and, if g(xo) # 0, H(z) are 


g(x) 


differentiable at xo. To be precise, 


(f(x) + g(x)) — (f (xo) + 9(20)) — lim (= =f ie) 4 Gan = ao) 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_A4, 
© Springer International Publishing Switzerland 2015 


450 A.4 Complements on differential calculus 


Next we prove (6.4). For this, recall that a differentiable map is continuous (Pro- 
position 6.3), so lim g(x) = g(xo). Therefore 
L—>XO 


f(x) g(x) — f (xo) g(xo) 


lim 
L+>XO wv — XO 
LO tL — LO 
= fim (FEZ (0) + Hao) EY) 
L—->XO x ZrO x ZO 
Ji bay, (2) + Feo) iy, SI 


Eventually, we show (6.5). Since lim g(x) = g(a) 4 0, Theorem 4.2 ensures there 
L—XLO 


is a neighbourhood of x9 where g(x) # 0. Then the function = is well defined 
g(a 
on such neighbourhood and we can consider its difference quotient 


fi) _ f(xo) 


im 22 920) _ yy, £(@) 9(#0) — fo) 9(@) 
tro L— XO zo g(x) g(xo) (x — Xo) 
— tm f{%) 90) — F(vo) g(wo) + F(xo) 9(%0) = F (x0) 9) 
oe (x) g(@0) (x — xo) 


| 
7 


g 
Tan dm aes) Lop =e g(x) — 9{#0) 
L—->2XO g(x) g(Xo) fe ( g( 0) f( 0) ) 


1 : (tim f(x) — f (xo) 


> Proof of Theorem 6.7, p. 175 


Theorem 6.7 (“Chain rule”) Let f(x) be differentiable at xo € R and g(y) 
a differentiable map at yo = f(xo). Then the composition go f(x) = g(f(z)) 
is differentiable at xo and 


(go f)'(ao) = 9' (yo) f (xo) am 9 (f (x0)) f’ (ao): 


Proof. Let us use the first formula of the finite increment (6.11) on g at the point 


yo = f (xo): 


A.4.1 Derivation formulas 451 


g(y) — 9(yo) = 9'(yo)(y — yo) +O(y— Yo), Y¥— Yo- 


By definition of ‘little o’, the above means that there exists a map y such that 
lim y(y) = 0 = v(yo) satisfying 
y— Yo 


g(y) — 9(yo) = 9'(yo)(y — yo) + Y(y)(y — yo), ona neighbourhood I(yo) of yo. 


As f is continuous at x9 (Proposition 6.3), there is a neighbourhood I(x) of 2 
such that f(x) € I(yo) for all x € I(ao). If we put y = f(x) in the displayed 
relation, this becomes 


HAC) GIO) _ oi( fay) LAD Heo) - 
; T Xo XL ZO 
Observing that 

jim o(f(#)) = jim oly) =0 


by the Substitution theorem 4.15, we can pass to the limit and conclude 


g(F(@)) = g(F (20) 


an xwL— XO 
= 9! (F(¢0)) tim LO =F0) . dimw p(p(e)) tin HO= FCO) 
@L—>xX0 v Xo H (ee a 20) L+>X0 w ZO 


= 9'(f(x0))f"(20) - 


> Proof of Theorem 6.9, p. 175 


Theorem 6.9 (Derivative of the inverse function) Suppose f(x) is a 
continuous, invertible map on a neighbourhood of xo € R, and differentiable 
at xq, with f'(x9) #0. Then the inverse map f—'(y) is differentiable at yo = 
f(xo), and , 


f'(xo) 


(f7*)' (yo) = 


Proof. The inverse map is continuous on a neighbourhood of yo in virtue of 
Theorem 4.33. Write « = f~'(y), so that on the same neighbourhood 


f= fF") e-em  _ 
y — Yo f(x) — f(t)  L@ =F) 


L—-2XO 


By the Substitution theorem 4.15, with « = f~(y), we have 


ee a) ee as ee 1 = 
id Y — Yo 7 me f(x)—f(xo) f' (xo) : 


L—-XO 


452 A.4 Complements on differential calculus 


A.4.2 De l’Hopital’s Theorem 


> Proof of de l’H6pital’s Theorem, p. 200 


Theorem 6.41 (de l’H6pital) Let f,g be maps defined on a neighbourhood 
of c, except possibly at c, and such that 


lina (2) — ling ae, 


se Ae wc 


where L = 0,+00 or —oo. If f and g are differentiable around c, except possibly 
Gc, wig 0, and 7 
f'(2) 


zc GAGE) 


exists (finite or not), then also 


ten 


exists and equals the previous limit. 


Proof. The theorem includes several statements corresponding to the values as- 
sumed by L and c, and the arguments used for the proofs vary accordingly. For 
this reason we have grouped the proofs together into cases. 


a) The cases L = 0, c= 29, 29, Zo- 

Let us suppose c = 24. By assumption lim f(z) = lim g(x) = 0, so we may 
extend both functions to xo (re-defining their values if necessary) by putting 
f (xo) = g(xo) = 0; thus f and g become right-continuous at 29. Let I+ (xo) denote 
the right neighbourhood of xo where f, g satisfy the theorem, and take x € I* (x9). 
On the interval [xo,2] Theorem 6.25 is valid, so there is t = t(x) € (20,2) such 


that 
f(z) _ f(x)—f(xo) _ fl) 


g(x) g(a) —g(ao) g(t) 
As 2 < t(x) < a, the Second comparison theorem 4.5 guarantees that for x 
tending to xp also t = t(a) approaches 29. Now the Substitution theorem 4.15 


yields 
f@) _ , £e@) _ , £0 


im ; 
sat g(x) coat g' (t(x)) toag g'(t) 


and the proof ends. 
We proceed similarly when c = xo ; the remaining case c = xg descends from the 
two one-sided limits. 


b) The cases L = 0, c = £00. 


A.4.2 De ’H6pital’s Theorem 453 


Suppose c = +00. The substitution z = 4 leads to consider the limit of the quotient 


1 
= d 1 1 1 
Me) for z > 0+. Because — f (=) tee tid (2). and similarly for the map g, 
g(5) dak 2 Z Zz 
it follows d 
we del a) se FG) f'(a) 
z Oe d x z30+ g/ (4) a plea (a) , 
ee hey, M2 2 


In this way we return to the previous case c = 07, and the result is proved. The 
same for c = —oo. 


c) The cases L = too, c= 24, 2p; Zo. 


/ 
Assume c = x¢ and put lim, e = ¢. When ¢ € R, let I* (a) be the right 
wx 
neighbourhood of xo on which. f and g satisfy the theorem. For every ¢ > 0 there 
exists 6, > 0 with rp + 61 € I*(xo) so that for all x € (x9,x%9 + 61) we have 
f'(z) 
g(x) 
(x, 2% +61) such that 


— ; < €. On [x,x%9 + 6;] Theorem 6.25 holds, hence there is t = t(x) € 


f(x) = f(to +61) _ f'(t) 


SS SS SS ; A.4.1 
g(x) — g(to +61) g(t) ( ) 
Write the ratio F(a) as 
a fe) rt) 
xv 
ga) ge) 
where, by (A.4.1), 
1 9vot81) 
p(x) = aCe with ee =, 


because L = too. The last limit implies that there is a dg > 0, with 62 < 61, such 
that 
|w(a)| <2 and lw(a) —1]<e 


for every x € (%o, Xo + 62). Therefore, for all x € (20, 20 + 62), 


Om eae ren em 
Fe = lo ZO — vanes veeye-¢ 
Oe oe ; 
= wool | A — ¢] + wee) — le < 2+ ee. 
We conclude 
lim Plz) _ 


454 A.4 Complements on differential calculus 


Let now £ = +00; for all A > 0 there is 6, > 0, with zp + 6, € IT (zo), such that 
f'(z) 

x 
observe that lim ~(x) = 1 implies the existence of a 62 > 0, with d2 < 41, such 


+4 


that w(x) > 4 for all x € (40, 29 + 62). Therefore, for every x € (#0, Zo + 62), 
/ 
g(t) ~ 2 


proving the claim. The procedure is the same for @ = —co. 
An analogous proof holds for c = x9 , and c = &pQ is dealt with by putting the 
two arguments together. 


for all x € (x0, %o + 61) we have > A. As before, using Theorem A.2.2 we 


A, 


wa 


d) The cases L = too, c = +00. 


As in b), we may substitute z = 4 and use the previous argument. 


A.4.3 Convex functions 


We begin with a lemma, which tells that local convexity (i.e., convexity on a 
neighbourhood of every point of J) is in fact a global condition (valid on all of J). 
intorno di ogni punto di J) é in realta globale (cioé valida su tutto J). 


Lemma A.4.1 Let f be differentiable on the interval I. Then f is convex on 
I if and only if for every xo € I 


f(x) > f(to)+ f'(wo)\(a@-—2z0)  Vael. (A.4.2) 


Proof. Obviously, it is enough to show that if f is convex according to Defini- 
tion 6.33 on J, then also (A.4.2) holds. To this end, one usefully notes that f is 
convex on J if and only if the map g(x) = f(«)+az+b, a,b € R is convex; in fact, re- 
quiring f(x) > f (20) +f"(a0)(a—20) is equivalent to g(x) > 9(20)-+9!(a0)(7—20).- 

Let then 2 € I be fixed arbitrarily and consider the convex map g(x) = 
f(x) — f(xo) — f’(x0)(x— 20), which satisfies g(xo) = g’(ao) = 0. We have to prove 
that g(x) > 0, Vx € I. Suppose x9 is not the right end-point of J and let us show 
g(x) > 0, Va ET, x > xo; a ‘symmetry’ argument will complete the proof. 

Being g convex at xo, we have g(x) > 0 on a (right) neighbourhood of zo. It 
makes then sense to define 


P=l{e> xy pols) > 0, Vs € [@o,2)} 
and x; = sup P. 


If x; coincides with the right end-point of J, the assertion follows. Let us 
assume, by contradiction, x, lies inside J; By definition g(x) > 0, Vx € [xo, 21), 


A.4.3 Convex functions 455 


while in each (right) neighbourhood of x, there exist points x € I at which g(x) < 
0. From this and the continuity of g at x; we deduce that necessarily g(x1) = 0 (so, 
in particular, 7} = max P). We want to prove g(x) = 0, Va € [xo, x71]. Once we 
have done that, then g/(x1) = 0 (as g is differentiable at x; and constant on a left 
neighbourhood of the same point). Therefore the convexity of g at x; implies the 
existence of a neighbourhood of 2; where g(x) > 0, against the definition of 71. 

It remains to prove g(x) = 0 in [xo, 21]. As g(x) > 0 on [20,21] by definition, 
we assume, again by contradiction, that M = max{g(x) : x € [xo,x1]} > 0, and 
let Z € (40,21) be a pre-image of g(%) = M. By Fermat’s Theorem 6.21 g/(z) = 0, 
so the convexity at Z yields a neighbourhood of on which g(x) > g(z) = M; 
but M is the maximum of g on [x9, 21], so g(x) = M on said neighbourhood. Now 
define 

Q={e2>229(8) =, Ve € |x, x)} 

and t2 = supQ. The map g is continuous, hence x2 = maxQ, and moreover 


rq < x1 because g(x%1) = 0. As before, the hypothesis of convexity at x2 leads to 
a contradiction. 


> Proof of Theorem 6.37, p. 193 


Theorem 6.37 Given a differentiable map f on the interval I, 


a) if f is conver on I, then f' is increasing on I. 


b1) If f’ is increasing on I, then f is convex on I; 


b2) if f’ ts strictly increasing on I, then f is strictly convex on I. 


Proof. 
a) Take x1 < 2&2 two points in J. From (A.4.2) with v9 = v1 and x = x2 we obtain 


f'(x1) = f (x2) 7 f (x1) 
©2— XM 
while putting %9 = %2, Y = X, gives 
(v2) — f(@1) 


a < f(a). 


Combining the two inequalities yields the result. 


b1) Let x > x be chosen in J. The second formula of the finite increment of f on 
[x9, x] prescribes the existence of a point Z € (xo, x) such that 


i) =f Go) +7 @)\e—29) + 


The map f’ is monotone, so f’(%) > f’(ao) hence (A.4.2). When x < 2o the 
argument is analogous. 

b2) In the proof for b1) we now have f’(Z) > f’(xo), whence (A.4.2) is strict (for 
x Fig). 


456 A.4 Complements on differential calculus 


A.4.4 Taylor formulas 


We open this section by describing an interesting property of Taylor expansions. 
Observe to this end that if a map g is defined only at one point xo, its Taylor 
polynomial of degree 0 can still be defined, by letting T'go,2,(x) = g(xo). 


Lemma A.4.2 Let f ben times differentiable at xo. The derivative of order 
h,O<h<vn, of the Taylor polynomial of f of degree n at xo coincides with 
the Taylor polynomial of f™ of order n—h at xo: 


Da ea) = Te a, (A.4.3) 


In particular, 


I Ca) = Can Oo Sonate (A.4.4) 


Proof. From Example 6.31 i) we know that 


Therefore 


Note also that _ 
fO (x9) = fF (a9) = (6%) - (x0) , 


in other words differentiating k — h times the derivative of order h produces the 
kth derivative. In this way, putting ¢ = k — h gives 


ra (k —h)! 
n—-h (h) (2) 
= (/ D (2 ) (at — to)* _ Te) ’ 
£=0 ; 


which is (A.4.3). Formula (A.4.4) follows by recalling that the Taylor expansion 
at a point xo of a function coincides with the function itself at that point. O 


A.4.4 Taylor formulas 457 


> Proof of Theorem 7.1, p. 228 


Theorem 7.1 (Taylor formula with Peano’s remainder) Let n > 0 and 
f ben times differentiable at xo. Then the Taylor formula holds 


f(x) =T fn,xo(z) + o((@ —20)"), 2-20, 
where 


nm 


il 


aaa) — a (xo) (x a xo)" 
k=O 


= f(xo) + f’ (x0) (w@ — ao) +--+ FO (a0)(2 — 29)”. 


Proof. We need to show that 


L= lim f() = T fn,xo (©) 


u—x0 (a — xp) 


=0. 


The limit is an indeterminate form of type 8. in order to apply de l’Hopital’s 
Theorem 6.41 we are lead to consider this 


“ f' (x) _ (2 Feces, (a) = Tien f'(x) ae ee re 


aro =n(x—20)"1 aro =n(%— ao)! 


(in which Lemma A.4.2, with h = 1, was used); note that the other requirements 
of 6.41 are fulfilled. 

For n > 1 we are still in presence of an indeterminate form 8. so repeating 
n — 1 times the argument above brings us to the limit 


fr @=T @ 


li 1,20 
n¥0 n!(x — x) 
egg LOE) = FY (0) = F@o)( = a0) 
r+>2X9 nl (a — Xo) 
1 (n—-1) _ £(n—1) 
asia, (poe _ f'°%(x0) = 
n! 2x0 L— Xo 


by definition of nth derivative at x9. This grants the green light to the use of de 
V’Hopital’s Theorem, and L = 0. O 


458 A.4 Complements on differential calculus 


> Proof of Theorem 7.2, p. 228 


Theorem 7.2 (Taylor formula with Lagrange’s remainder) Let n > 0 
and f differentiable n times at xo, with continuous nth derivative, be given; 
suppose f is differentiable n+1 times around xo, except possibly at xo. Then 
the Taylor formula 


il 
(n+ 1)! 


FROG) (x = eae 


F(@) = T fn,axo(&) + 


holds, for a suitable between xo and x. 


Proof. Let p(x) = f(x) — T fine (x) and w(x) = (a — a9)"*1. Using (A.4.4), for 
h=0,...,n we have 

e (wo) = 0; 
moreover, ¢)\")(29) = 0 and p(x) 4 0 for any x 4 ao. Applying Theorem 6.25 
to y, w on the interval Jo between x9 and x, we know there is a point x; € Jo such 
that 


g(r) _ v(t) 9(%o) _ ¥'(%1) 
p(x) v(x)—v(to) (21) 
The same recipe used on the maps y’(x), w(x) on the interval J; between xo, 11 


produces a point x2 € J, C Ip satisfying 


g(a) _ o'(a1) — e'(@o) _ oe" (x2) 
U(a1)  W(@1)— "(ao)" (wa) 


Iterating the argument eventually gives a %n41 € Jo such that 


g(z) pl) (n+) 

p(x) POD (Gn41) 
But pt) (a) = fF) (2) and pt) (ez) = (n+ 1)!, putting = x_41 in which 
yields the assertion. O 


> Proof of Theorem 9.44, p. 338 


Theorem 9.44 (Taylor formula with integral remainder) Let n > 0 
be an arbitrary integer, f differentiable n+ 1 times around a point xo, with 
continuous derivative of ordern+1. Then 


f(t) —Tfrxo(t) = - i POH) (@ — 4)” dt. 


A.4.4 Taylor formulas 459 


Proof. We shall use the induction principle (see Appendix A.1). When n = 0, the 
formula reduces to the identity 


fla) - (00) = f feat, 
xO 
established in Corollary 9.42. 


Supposing the statement true for a certain n, let us prove it for n+1. Integrating 
by parts and using the hypothesis, 


=a |e owe-om |) teen ffm @—e" ae 
(n+1) (y * 

_ ac - or (% — ao)"** + 5 : FFD O)(e — 1)” dt 
(n+1) (» 

= Fa _ Gig) + f(z) - T fn,xo (x) 


= f(t) — T fn+1,20(#) - 


At last, we provide an example that illustrates how a more accurate piece of 
information may be extracted from the integral form of the remainder, as opposed 
to the Lagrange form. 


Example A.4.3 Consider the MacLaurin expansion of the exponential function 
f(x) = e” with remainder of order 1, both in Lagrange’s form and in integral form. 
Assuming x > 0, if we use the former form we have for a suitable Z € (0, x) 


1 
e=1+a2+ 508 ; (A.4.5) 
whereas with the latter form we obtain 
e=1+c+ | e'(x —t)dt. (A.4.6) 
0 


Since the exponential function is strictly increasing, it holds e’ < e”, hence, we 
deduce from (A.4.5) that the error due to approximating e” by the polynomial 
1+ 2 satisfies 


il 
(<= (1g) < xe ; (A.4.7) 
On the other hand, if we look at the integral remainder, we easily check that the 


function g(t) = e'(a—t) under the integral sign admits for x > 1 a strict maximum 
at t = x — 1, where it takes the value e*—!. Hence, 


460 A.4 Complements on differential calculus 


O0< | e'(x —t) dt < ef dt=e*— 1a. 
0 0 
Therefore, we deduce from (A.4.6) that 


1 
O<e”—(1l+z2z)< ete eo As (A.4.8) 


Since it is trivially seen that 4x" a sue" for x > 1, we conclude that (A.4.8) 
provides a more accurate estimate of the approximation error than (A.4.7) does. 
For instance, for x = 1 the error is 


a= 741) Se 2071828... 


inequality (A.4.7) gives the upper bound 0.71828... < $e = 1.35914..., whereas 


(A.4.8) gives the bound 0.71828... < 4e = 1, which is sharper. 


A.5 


Complements on integral calculus 


We begin this appendix by checking the convergence of the two sequences that 
enter the definition of the Cauchy integral. We then consider the Riemann integral; 
we justify the integrability of relevant classes of functions, and we establish several 
properties of integrable functions and of the definite integral. We conclude by 
proving a few results concerning improper integrals. 


A.5.1 The Cauchy integral 


> Proof of Theorem 9.20, p. 320 


Theorem 9.20 The sequences {s,} and {S,,} are convergent, and their limits 


coincide. 


Proof. We claim that for any p > 1 
Ba Suns Son = Ons 


In fact, subdividing the interval J; in p subintervals I,; (1 <i < p) of equal width 
Az/p, and letting 
Mpg = min Fe), 
LET Ki 


it follows mz, < mz; for each 7, hence 


: Ax 
mz, Ax < So mpi — - 
i=1 P 


Summing over k we obtain 8, < 8p,. The second inequality is similar. 
Let now s,,, Sm be arbitrary sums. Since 


Sn SSnm S Dai a Sr 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9_A5, 
© Springer International Publishing Switzerland 2015 


462 A.5 Complements on integral calculus 
any lower sum is less or equal than any upper sum. Define 


S = SUP Sy and S =infS,,. 
n n 


We know s < S,, holds for any m, so s < S. We wish to prove that s = S, and 
that such number is indeed the required limit. By the Heine-Cantor’s Theorem 
A.3.7 the map f is uniformly continuous: given ¢ > 0, there is 6 > 0 such that if 
x’, x” € [a,b] then 


lj’ —a2"|<6 implies  |f(2’) —f(x)| <e. 


Let n- be the integer such that ba < 6. Take any n > n,; in each subinterval I, 
of [a,b] of width Ax = &* there exist points &, and 7, such that 


F(éx) = me = min f(t) and f (nm) = Mi = max f(x). 


2 < 0, it follows 


€ 


My — mx = f (mM) — FE) <e. 


b— 
As |n, — & |< =* < 


b— 
nr 


Therefore 


Sn —- Sn = 57M. Aa — yee 
k=1 k=1 


=\ Gomi ance Ac =8), 


k=1 k=1 


In other words, given ¢ > 0 there is an nz > O so that for all n > nz we have 
0 < Sp — Sn < e(b—a). This implies 


S—s<Sp— 8, <e(b—a). 
Letting ¢ tend to 0, S = s follows. In addition, 


S — Sn <Sy—-S8n<€ it > ig, 


that is, 
lim s,=S. 
N— Oo 
The same arguments may be adapted to show lim S, =S. O 


noo 


A.5.2 The Riemann integral 


Throughout the section we shall repeatedly use the following result. 


A.5.2 The Riemann integral 463 


Lemma A.5.1 Let f be a bounded function on I = [a,b]. Then f is integrable 
if and only if for any ¢ > 0 there exist two maps he € St and ge € Se such 


that 


[ter fa<e. 
I I 


Proof. According to the definition, f is integrable if and only if 


[rqint| farnesp\—=su{ faraes;}. 


Let then f be integrable. Given ¢ > 0, by definition of lower and upper bound one 
can find a map hz € SF satisfying i — ff < ¢/2 and, similarly, a function 
i I 


Ge E Sy such that [i — [io < ¢/2. Hence 
I I 


[re- foc= fre- fre [t- face. 


Vice versa, using Definition 9.26 together with Property 9.27, one has 


fas fre [rs [ne 
[1- [ts [rr face. 


But ¢ is completely arbitrary, so i —_ | f. In other words, f is integrable on [a, }]. 
Ji I 


hence 


> Proof of Theorem 9.31, p. 327 


Theorem 9.31 Among the class of integrable maps on [a,b] are 


a) continuous maps on |a, b]; 


b) piecewise-continuous maps on |a, b]; 
c) continuous maps on (a,b) which are bounded on |a, b]; 
d) monotone functions on [a,b]. 


Proof. 
a) The Theorem of Weierstrass tells that f is bounded over [a,b], and by Heine- 
Cantor’s Theorem A.3.7 f is uniformly continuous on [a,b]. Thus for any given 


464 A.5 Complements on integral calculus 


€ > 0, there exists a 6 > 0 such that if x’,2” € [a,b] with |z’ — 2”| < 6 then 


|f(a’) — f(a’”)| < e. Let us consider a partition {%,71,...,%n} of [a,b] such 
that each interval [r,_1, 7%] has width < 6 (k =1,...,n). We apply Weierstrass’ 
Theorem 4.31 to each one of them: for every k = 1,...,n, there are points €4, 7K € 


[v~—1, 2%] Such that 


Jee — tiie =, min, - f(a) and FOES Mie. mex... f(a) 2 


LE[Lp-1,L4| LE[L_-1,L%] 


Since Mk = Ex xO: 
My — mn = f (ne) — f(ék) <é. 
Let he € SF and ge € Sp be defined by 


he = er ieee (re el: MS 234 


fia) ite, 

ie. Ae eesti. KS le Ms 
ge(x) oF ' 

fa) ta =a. 

For any x € [a,b] we have h,(x) — g-(x) < €, hence 


[rem face fte-9.)< fe=b-ae. 


Given that ¢ is arbitrary, Lemma A.5.1 yields the result. 


b) Call {21, r2,...,%n—1} the discontinuity points of f inside [a,b], with x,_1 < &x, 
and set x) = a and x, = b. For k = 1,...,n, consider the continuous maps on 
[%~—-1, Lx] defined as follows: 


Te) if x € (%e_-1, 2k), 
hinds (fo). eee aa 
Oe ee 
liye): | a, 
LX, 


Mimicking the proof of part a), given ¢ > 0 there exist hz, € St, Je,k € Sr, such 
that 
hea (2): = gen (2) <e ; Vee [tpg Dee 


Define hz € SF and ge € Sy by 


hegte). Wee Gets. bad aay, 


y= ee if 2 = a, 


ae ee ie peels eH le 
ae f(a) i =O; 


For any x € [a,b] then, h.(x) — g.-(x) < €; as before, Lemma A.5.1 ends the proof. 


A.5.2 The Riemann integral 465 


c) Fix e > 0so that I, = [a+e,b-—e] C [a,b]. The map f is continuous on J, and 
we may find — as in part a) — two step functions defined on J-, say ye and wW-, such 
that 


Ye(x) — Tf) < We(x) and We (x) _ pe(x) <e, Vael,. 


Name M = sup f(x) and m = inf f(x) the supremum and infimum of f. Consider 
vel ie 


the step functions hz € SF, Ge € Sp given by 


h(x) = oem gS es oe a if x € Ie, 


M if x ¢ Ie, m ifa ¢é Ie. 


Theorem 9.33 i) implies 


[re foc=f thea) f the-aed+ f (he ~s6) 
I I [a,at+e] Te [b—e,b] 


=2(M—m)e+ | (he ~ 96) 


ae ee ere (2(M —m)+b-a)e. 


Now Lemma A.5.1 allows to conclude. 


d) Assume f is increasing. (In case f is decreasing, the proof is analogous.) Note 
first that f is bounded on [a,b], for f(a) < f(a) < f(b), Va € [a, b). 


—a 
Given € > 0, let n be a natural number such that n > 


; split the interval 


b—a ; i foes oa 
into n parts, each —— < e wide, and let {xo,%1,...,%,} indicate the partition 


n 
points. Introduce the step maps hy € SF, gn € Sp by 


jin) Wee Spite], PS, 
fia) te=a, 


al i ee Opie R= Laat 
92") fa) ifa=a. 
Then 


n 


[im = | m = S/F (wx) (ex — @e_1) — >> fees) (@e — te-1) 
I I — 


k=1 


a2 Gad iid) 


nr 
k=1 


e(f(b) — f(a). 


Once again, the result follows from Lemma A.5.1. Oo 


466 A.5 Complements on integral calculus 


> Proof of Proposition 9.32, p. 328 


Proposition 9.32 If f is integrable on |a,b], then 


i) f is integrable on any subinterval [c,d] C |a, }]; 


ii) |f| is integrable on |a, b}. 


Proof. 
i) If f is a step function the statement is immediate. More generally, let f be 
integrable over [a,b]; for e¢ > 0, Lemma A.5.1 yields maps hz € SF, Je © S, such 


that 
b b b 
[re faa f (te-a) <e. 


[re a) sf te~a6) <2. 


the result is a consequence of Lemma A.5.1 applied to the function f restricted 
to |[c, d]. 


ii) Recall |f| = f+ + f-, where f, and f_ denote the positive and negative parts 
of f respectively. Thus it is enough to show that f, and f_ are integrable, for 
then we can use Theorem 9.33 ii). 

Let us prove f+ is integrable. Given ¢ > 0, by Lemma A.5.1 there exist he € SF 
and g- € S; such that i fie = rh ge < €. Let {29,21,...,2n} be a partition of 
I = [a,b] adapted to both maps h-, g-. Consider the positive parts he+, ge,4 of 
the step functions. Having fixed an interval [;, = [x,-1, 2%], we may examine the 
three possible occurrencies 0 < gz < he, ge <0 < he or ge < he < 0. It is easy to 
check 

Ge,4 ae 5 Mes 


J rer-f gers f te- f sexe. 
Ip Ty Lk, Ik 


Consequently, hz, € St, ee Sy, and 
(| he | gc) <eé. 
Ir Tk 


free fger= Dif ten fi gee) < 


Lemma A.5.1 yields then integrability for f,. 
A similar proof would tell that f_ is integrable as well. 


and 


n 


k=1 


A.5.2 The Riemann integral 467 


> Proof of Theorem 9.33, p. 329 


Theorem 9.33 Let f and g be integrable on a bounded interval I of the real 
line. 


i) (Additivity with respect to the domain of integration) For any 


a,b,ce Tl, , 
[ tewar= f rears f fan 


ii) (Linearity) For any a,b€ I anda,B ER, 


i "(af 2) + B9(2)) dx = a if "f(e)de + 6 il fone 


iii) (Positivity) Let a,b € I, witha < b. If f > 0 on [a,b] then 


[ toa eae 


If f is additionally continuous, equality holds if and only if f is the zero 
map. 
iv) (Monotonicity) Let a,b €I,a< b. If f <g in [a,b], then 


[ ay bs [i da: 


v) (Upper and lower bounds) Let a,b € I, a < b. Then 


[sow < [slaw 


Proof. We shall directly prove statements i) -v) for generic integrable maps, for 
the case of step functions is fairly straightforward. 


i) We shall suppose a < c < 6, for the other instances descend from this and (9.18). 
By Proposition 9.32 i) f is integrable on the intervals [a, b], [a,c], [c, b]. Given e > 0 
moreover, let gz € Se, he € SF be such that 


b b b b b 
[rn fa<e and fosfts [re 


The property holds for step functions, so 


b c b c b c b b 
fo=fotfasfrefrs [rer frnafr 


468 A.5 Complements on integral calculus 


b c b b b 
fr-fr- fas fn fase. 
The claim follows because ¢ is arbitrary. 

ii) We split the proof in two, and prove that 
b b 
a) / af (x) de=a | Fe) dx 
pb ° b b 
b) / (f(e) + 9(a)) de = / f(x) de +f sae 


We start from a), and aad a < 6 for simplicity. When a = 0 the result is clear, 
so let a > 0. IfgeS;, he SF then ag € S, ap and ah € Si,; thus 


a fae) a= [ago ues [rose da 


Z [ose ae [ aes af ne) ae: 


b b 
From af g(x) da < i af (a) dx, taking the upper bound of the integrals Lg 


a 


and hence 


as g varies in S, , and ‘using the integrability of f on [a,b], we obtain 


af 1) a= af se) wes [ast da 


pb b 
similarly from / af(x) dx < a | h(x) dx we get 


a 


hence af se ar = fasta ale 


When a < 0, the proof is the same because g € Sp ,he S} satisfy ag € ee and 
ah € Sap 

Now part b). Take f; € Se, fo€ SF, mES7,g92€ 3S) ; then fi + gi € Sy. 
fo+g2€ SHi5s and 


f+g? 


A.5.2 The Riemann integral 469 


b 


b b 
fi(x) av f gi(z) ae = [ (fi(x) + g1(x)) ar < | (f(x) + 9(x)) dx 


pale 


b 
Fix gi, fe and ga, and take the upper bound of the integrals i fi(x) dxas fr € S; 


varies: 


[5 a+ footw de f (se) ate) as 


< [uw deo yrties [ ne) art foto oe 


b 
varying gi in S; and taking the upper bound of the integrals / gi(x) dx we find 


a 


[F a+ fale) te f (se) ale) dx 


< fie) +00) aes f° f(a) det [onl ae. 


Now we may repeat the argument fixing gz and varying fo € SF first, then varying 
g2 € SF, to obtain 


b b b 
[ fears f g@acs [ (Fe) +9(@) a 
Ab b b 
< f (ie) + ate) avs [poy ant f g(x) ae. 
iti) The zero map g belongs in S; (it is constant), hence 


o= foo as [fw an 


b 
Suppose f continuous; clearly f(x) = 0 forces / f(x) dx = 0. We shall prove 


b 
the opposite implication: / f(x) dx = 0 implies f(x) = 0. If, by contradiction, 


f(z) # 0 for a certain % é (a,b), Theorem A.2.2 would give a neighbourhood 


470 A.5 Complements on integral calculus 


Is(£) = (& — 6,+6) C [a,b] and a constant Ky > 0, for any x € Is(Z). The step 


function (@) 
_ Ke if « € I5(z 
a2) = i if x ¢ Is(Z) 


would belong to S;, and 
b b 
i 7 (2) ae / g(x) dx = 6Ky > 0, 


a contradiction. Therefore f(x) = 0 for all x € (a,b), and by continuity f must 
vanish also at the end-points a, b. 


iv) This follows directly from ii), noting h(x) = g(x) — f(x) > 0. 


v) Proposition 9.32 ii) says that |f| is integrable over [a,b]. But f = fy — fE 
(f, and f_ are the positive and negative parts of f respectively), so the linearity 
proven in part 7) yields 


[se a= fie ae [pe ee, 


Using the triangle inequality, property ii) (f+, f- > 0) and the relation |f| = 
f,+f_, we eventually have 
b 
/ fala) de 
b 


[5 ae 
= (6G) +4-@) w= [Ife ae. : 


< [ue dn| = [ie a+ fie) ay 


A.5.3 Improper integrals 


> Check of property (10.3), p. 362 


are || ae 
sin x ; 
dz converges, but | dx diverges. 
i 


x 


+00 4: 
sin x ; 
Proof. We explain first why / dx converges. Let us integrate by parts 
over each interval [1,a] with a > 1, by putting f(x) = + and g/(z) = sin; since 
f(x) = —4, (2) = — cos 2, it follows 


* sing cos x | * cos x 
dz = — — 5 
i x la i 2 


A.5.3 Improper integrals A471 


sin & 
the last integral is known to converge from Example 10.8. Thus the map —— has 
xL 


a well-defined improper integral over [1, +00). 


sin & 


Now let us convince ourselves that is not absolutely integrable on [1, ++oo). 


Since |sinz| < 1 for any x, we have 


sin? x 11—cos2a 
> SS 


sin x 


ze| a0 2 @& 


; : Teo] = Gog Oe 
We claim the integral ‘. — dz diverges, hence the Comparison test 
1 x 


sin x 


dz to diverge as well. In fact, 
x 


+00 +00 +oo 

1- 2 1 2 
/ cos 2x d= | = ar | cos 2x ae 
1 r 1 r 1 zr 


While the first integral on the right-hand side diverges, the second one converges, 


18 J = og Diy 
as can be proved by the same procedure as above. Therefore ; ——— dt 
1 x 


ame.) 
(Theorem 10.5) forces | 
1 


Mv 


s 
diverges, and the function cannot be absolutely integrable. 


> Proof of Theorem 10.10, p. 363 


Theorem 10.10 (Asymptotic comparison test) Suppose the function f € 
Rioc([a, +00)) is infinitesimal of order a, for x + +co, with respect to p(x) = 
1 

=. Then 

x 


i) ifa>1, f € R([a,+00)); 
+00 
1) teal i f(x) da diverges. 


a 


Proof. Since f(x) ~ a for 2 — +00, we may assume the map f has constant sign 
for x sufficiently large, for instance when x > A > 0. Without loss of generality we 
may also take f strictly positive, for otherwise we could just change sign. Moreover, 
for x + +00, 


1 i) 1 
fe)~ = > Fle) =O(=) and [= O(F(@)); 
x x x 
otherwise said, there exist positive constants c,, C2 such that 
<f(x)< 4, va>A. 
x x 


A472 A.5 Complements on integral calculus 


In order to conclude, it suffices to use the Comparison test (Theorem 10.5) jointly 
with Example 10.4. 


> Proof of Theorem 10.13, p. 364 


Theorem 10.13 (Integral test) Let f be continuous, positive and decreasing 
on |ko, too), for ko € N. Then 


+oo 


Yrs tars Y FW. 


hho ko k=ko 


Therefore the integral and the series share the same behaviour: 
Jes love) 
a) f(x) dx converges <> se f(k) converges; 


ko k=ko 
+oo 


b) f(a)dax diverges <> es f(k) diverges. 
Ko k=ko 


Proof. Since f decreases, for any k > ko we have 
fkA+1)<f@)<sfk), Vaeelk,k+]], 
and as the integral is monotone, 
k-+1 
fie+iy< f fa)ae< s). 
k 


Then for all n € N with n > ko we obtain 


n+1 n n 


S> f(k) < _ F(a) de < So f(k) 


k=ko +1 0 k=ko 


(after re-indexing the first series). Passing to the limit for m — +00 and recalling 
f is positive and continuous, we conclude. 0 


Tables and Formulas 


Recurrent formulas 


cos? x + sin? x = 1, Vc ER 


sint=0 sex=kr, VREZ, cosx=0O se cao tke 
sinx=1 se z= 5 + 2ker, cosx=1 se x=2k7 


sinz =—1 se z= — 5 + 2km, cosx=-—l se c=7+2k7 


sin(a + 8) = sinacos 8 + cosasin B 
cos(a + 6) = cosa cos 6 F sina sin B 


sin2x = 2sinxcosz, cos 2a = 2cos? x — 1 


_t-Y ry 
sinx — siny = 2sin 5 cos 5 


._ @&-y.,. «x+y 
cos x — cosy = —2sin 5 sin 5 


sin(z + 7) = —sing, cos(z +7) = —cosz 


a . 
sin(a + 5) =cost, cos(z + 3) = —sing 


xz 
7 a 
at TY = aa? , ge = re ay =a 
a 


log, (xy) = log, v+log,y, Vz,y>0 


x 
log, — =log,x—log,y, Vz,y >0 
y 


log, (z”) =ylog,z, Vz>0, VWyER 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9, 
© Springer International Publishing Switzerland 2015 


A474 Tables and Formulas 


Fundamental limits 


lim x*=+00, 
xL—>+00 


lim g¢* =, 
L—>+00 


r Ant” +...+a,x% + a9 An r ee 
Oe —= mM wz 
L—1Co bmax™ + bose + bix + bo bm LLCO 


lim a” = +00, lim a* =O, aSl1 
t—>+00 «L—>—0o 


lim. = 0, lim a* =+00, Ga 1 
x4 =L—>—0o 


hm: log, = =60; a>tl 
a—+0t 


lim log, 7 = +00, a= 
a—0+ 


lim tanz do not exist 
LS ECO 


tanzx=- Foo, VkEZ, lim arctang =+— 
w—($+kr)~ E—r00 2 


. : Tv ; 
lim arcsing = +— = arcsin(+1) 
atl 2 


lim | arccosx = 0 = arccos1, lim | arccos x = 7 = arccos(—1) 
= } bie aed 


in particular, 


in particular, 


Tables and Formulas A475 


Derivatives of elementary functions 


sin x 


COS & 
tan x 
arcsin £ 
arccos & 
arctan & 
qa” 

log, || 


sinh x 


cosh x 


Differentiation rules 


476 Tables and Formulas 


Maclaurin’s expansions 


ee 


es (eee yee 
COS & aH 


3 7? 


; x 
sinha -a2+atat.-- 


2 ot 


x 
hea=-l1l+—4+—4... 
cosh x tata 


i _ ea mana 2 
arcsin £ e+ 6 + 10 +... a 


ao 37° 1 ) g2mtl 


2m+1 


ge 


arcta =xr-—+—-—... 
rctanz = x 3° 5 


(l+2)°=l+ar+ = 


1 
ay shee te?—...+(-1)"2" + of2") 


1 1 1 
VIite=1+ 52-20" + 2° +o(2") 


Tables and Formulas AT7 


Integrals of elementary functions 


—cosxr+c 
sinz+c 
e* + ¢ 
coshxz +c 


sinha+e 


arctanx +c 


arcsin xz +c 


log(a + Va? + 1)+c=settsinhae +c 


log(z + Vz? —1)+c=sett cosha+e 


Index 


Absolute value, 13 
Antiderivative, 302 
Arc, 282 

closed, 282 

Jordan, 282 

length, 377 

simple, 282 
Arccosine, 56, 114 
Archimedean property, 16 
Arcsine, 56, 114, 176, 336 
Arctangent, 114, 176, 336 
Argument, 275 
Asymptote, 135 

horizontal, 135 

oblique, 135 

vertical, 136 


Binomial 
coefficient, 19, 234 
expansion, 20, 428 
Bisection method, 111 


Cardinality, 2 
Colatitude, 261 
Combination, 20 
Conjunction, 5 
Connective, 5 
Coordinates 
cylindrical, 261 
polar, 259 
spherical, 261 
Corner, 178 


Cosine, 52, 101, 173, 176, 232 


hyperbolic, 198, 237 


Cotangent, 54 
Curve, 281 


congruent, 372 
equivalent, 372 
integral, 390 
opposite, 372 
piecewise regular, 284 
plane, 281 

regular, 284 

simple, 282 


De Morgan laws, 4 
Degree, 50, 52 
Derivative, 170, 190 


backward, 178 
forward, 178 
left, 178 
logarithmic, 176 
of order k, 190 
partial, 288, 290 
right, 178 
second, 190 


Difference, 4 


quotient, 169 
symmetric, 4 


Differential equation 


autonomous, 390 


homogeneous, 396, 399, 406 


linear, 396, 406 
ordinary, 389 
solution, 389 


with separable variables, 394 


C. Canuto, A. Tabacco: Mathematical Analysis I, 2nd Ed., 
UNITEXT - La Matematica per il 3+2 84, DOI 10.1007/978-3-319-12772-9, 
© Springer International Publishing Switzerland 2015 


480 Index 


Discontinuity 
of first kind, 84 
of second kind, 84 
removable, 78 
Disjunction, 5 
Domain, 31 


Equation 
characteristic, 407 
Equivalence 
logic, 6 
Expansion 
asymptotic, 243 
Maclaurin, 229, 235 
Taylor, 228 
Exponential, 50, 173, 229 


Factorial, 18 
Form 
algebraic, 272 
Cartesian, 272 
exponential, 276 
indeterminate, 99, 107 
normal, 390 
polar, 275 
trigonometric, 275 
Formula, 5 
addition, 54 
contrapositive, 6 
De Moivre, 277 
duplication, 54 
Euler, 276 
finite increment, 186 
Stirling, 141 
subtraction, 54 
Taylor, 228, 456, 457 
Function, 31 
absolute value, 33, 34 
absolutely integrable, 362 
arccosine, 56, 114 
arcsine, 56, 114, 176, 336 
arctangent, 114, 176, 336 
asymptotic, 136 
big o, 123 
bijective, 40 
bounded, 37, 95 
bounded from above, 37 
composite, 103, 175, 241 
composition, 43 


concave, 192 
continuous, 76, 80, 287 
continuous on the right, 83 
convex, 192 
cosine, 52, 101, 173, 176, 232 
cotangent, 54 
decreasing, 42 
differentiable, 170, 190 
equivalent, 124 
even, 47, 177, 229 
exponential, 50, 173, 229 
hyperbolic, 198 
hyperbolic cosine, 198, 237 
hyperbolic sine, 198, 237 
hyperbolic tangent, 199 
increasing, 41 
infinite, 130 
infinite of bigger order, 131 
infinite of same order, 131 
infinite of smaller order, 131 
infinitesimal, 130, 244 
injective, 38 
integer part, 33, 34 
integrable, 326 
integral, 333 
inverse, 38, 114, 175 
cosine, 56 
hyperbolic tangent, 200 
hyperbolic cosine, 200 
hyperbolic sine, 199 
sine, 55 
tangent, 56 
invertible, 39 
little o, 124 
logarithm, 51, 114, 176, 231 
mantissa, 34 
monotone, 41, 84, 114, 188 
negative part, 361 
negligible, 124 
odd, 47, 177, 229 
of class C*, 191 
of class C*, 191 
of real variable, 32 


of same order of magnitude, 124 


of several variables, 286 
one-to-one, 38, 114 
onto, 38 

periodic, 47 

piecewise, 32 


piecewise-continuous, 319 
polynomial, 50, 98, 100, 174, 315 
positive part, 361 

power, 48, 234 

primitive, 302 

rational, 50, 98, 100, 101, 312 
real, 32 

real-valued, 32 

Sign, 33, 34 

sine, 52, 79, 93, 106, 173, 232 
step, 323 

surjective, 38 

tangent, 54, 175, 240 
trigonometric, 51 

uniformly continuous, 447 


Gap, 84 
Gradient, 288 
Graph, 31 


Image, 31, 36 
of a curve, 281 
Implication, 5 
Inequality 
Bernoulli, 139, 427 
Cauchy-Schwarz, 266 
triangle, 13 
Infimum 
of a function, 37 
of a set, 17 
Infinite, 203 
of bigger order, 131 
of same order, 131 
of smaller order, 131 
test function, 131 
Infinitesimal, 130, 203 
of bigger order, 130 
of same order, 130 
of smaller order, 130 
test function, 131 
Inflection, 193, 247 
ascending, 193 
descending, 193 
Integral 
Cauchy, 320 
definite, 319, 321, 323, 326 
general, 391 
improper, 358, 365, 369 
indefinite, 302, 303 


Index 


line, 370, 378 
lower, 325 
mean value, 330 
particular, 391 
Riemann, 322 
singular, 394 
upper, 325 
Integration 
by parts, 307, 338 
by substitution, 309, 317, 338 
Intersection, 3, 7 
Interval, 14 
of monotonicity, 42, 188 


Inverse 
cosine, 56, 114 
sine, 55, 114 


tangent, 56, 114 


Landau symbols, 123 
Latitude, 261 
Length 
of a curve, 375, 376 
of a vector, 263 
Limit, 68, 70, 73, 76, 81 
left, 82 
right, 82 
Logarithm, 51, 106, 114, 176, 231 
natural, 72 
Longitude, 261 
Lower bound, 15 
greatest, 17, 113 


Map, 31 

identity, 45 
Maximum, 16, 37 
absolute, 180 
relative, 180 
Minimum, 16, 37 
Modulus, 274 


Negation, 5 
Neighbourhood, 65, 287 
left, 82 
right, 82 
Norm 
of a vector, 263 
Number 
complex, 272 
integer, 9 


481 


482 Index 


Napier, 72, 106, 173, 437 
natural, 9 

rational, 9 

real, 10 


Order, 244 
of a differential equation, 389 
of an infinite function, 132 


of an infinitesimal function, 132 


of magnitude, 203 


Pair 
ordered, 21 
Part 
imaginary, 272 
negative, 361 
positive, 361 
principal, 133, 244 
real, 272 
Partition, 322 
adapted, 323 
Period, 10, 47 
minimum, 48 
Permutation, 19 
Point 
corner, 178 
critical, 181, 245 
cusp, 179 
extremum, 180 
inflection, 193, 247 
interior, 15 
jump, 84 
Lagrange, 184 
maximum, 180 
minimum, 180 
of discontinuity, 84 
with vertical tangent, 179 
Polynomial, 50, 98, 100, 174, 315 
characteristic, 407 
Taylor, 228 
Pre-image, 36 
Predicate, 2, 6 
Primitive, 302 
Principle of Induction, 427 
Problem 
boundary value, 393 
Cauchy, 392 
initial value, 392 


Product 

Cartesian, 21 

dot, 266 

scalar, 266 
Prolongation, 78 
Proof by contradiction, 6 


Quantifier 
existential, 7 
universal, 7 


Radian, 52 
Radius, 65 
Range, 31, 36 
Refinement, 322 
Region 
under the curve, 319 
Relation, 23 
Remainder 
integral, 338, 458 
Lagrange, 227, 229, 458 
of a series, 145 
Peano, 227, 228, 457 
Restriction, 40 


Sequence, 32, 66, 104, 137 
convergent, 68 
divergent, 70 
geometric, 138 
indeterminate, 71 
monotone, 71 
of partial sums, 142 
subsequence, 441 

Series, 141 
absolutely convergent, 152 
alternating, 151 
conditionally converging, 153 
converging, 142 
diverging, 142 
general term, 142 
geometric, 146 
harmonic, 148, 152, 364 
indeterminate, 142 
Mengoli, 144 
positive-term, 146 
telescopic, 145 


Set, 1 
ambient, 1 
bounded, 15 


bounded from above, 15 
bounded from below, 15 
complement, 3, 7 
empty, 2 

power, 2 


Sine, 52, 79, 93, 106, 173, 232 


hyperbolic, 198, 237 


Subsequence, 441 
Subset, 1, 7 

Sum of a series, 142 
Supremum 


of a function, 37 
of a set, 17 


Tangent, 54, 171, 175, 240 
Test 


absolute convergence, 153, 361 

asymptotic comparison, 148, 363, 367, 
A71 

comparison, 147, 360, 367 

integral, 364, 472 

Leibniz, 151 

ratio, 139, 149 

root, 150 


Theorem 


Bolzano-Weierstrass, 442 

Cauchy, 185 

comparison, 92, 95, 137 

de Hopital, 200, 452 

existence of zeroes, 109, 429 

Fermat, 181 

Fundamental of integral calculus, 333 
Heine-Cantor, 448 

intermediate value, 112 

Lagrange, 184 


Index 


local boundedness, 431 
Mean Value, 184 
Mean Value of integral calculus, 331 
Rolle, 183 
substitution, 102, 138 
uniqueness of the limit, 89 
Weierstrass, 114, 443 
Translation, 45 


Union, 3, 7 

Unit circle, 51 

Upper bound, 15 
least, 17, 113 


Value 
maximum, 37 
principal, 276 
Variable 
dependent, 36, 169 
independent, 36, 169 
Vector, 262 
at a point, 270 
direction, 263 
field, 378 
lenght, 263 
orientation, 263 
orthogonal, 266 
perpendicular, 266 
position, 262 
space, 264 
tangent, 284 
unit, 265 
Venn diagrams, 2 


Zero, 108 


483 


Collana Unitext - La Matematica per il 3+2 


Series Editors: 

A. Quarteroni (Editor-in-Chief) 
L. Ambrosio 

P. Biscari 

C. Ciliberto 

M. Ledoux 

WJ. Runggaldier 


Editor at Springer: 
F. Bonadei 
francesca.bonadei@springer.com 


As of 2004, the books published in the series have been given a volume num- 
ber. Titles in grey indicate editions out of print. 
As of 2011, the series also publishes books in English. 


A. Bernasconi, B. Codenotti 
Introduzione alla complessita computazionale 
1998, X+260 pp, ISBN 88-470-0020-3 


A. Bernasconi, B. Codenotti, G. Resta 
Metodi matematici in complessita computazionale 
1999, X+364 pp, ISBN 88-470-0060-2 


E. Salinelli, E Tomarelli 
Modelli dinamici discreti 
2002, XII+354 pp, ISBN 88-470-0187-0 


S. Bosch 
Algebra 
2003, VIII+380 pp, ISBN 88-470-0221-4 


S. Graffi, M. Degli Esposti 
Fisica matematica discreta 
2003, X+248 pp, ISBN 88-470-0212-5 


S. Margarita, E. Salinelli 
MultiMath - Matematica Multimediale per Universita 
2004, XX+270 pp, ISBN 88-470-0228-1 


A. Quarteroni, R. Sacco, ESaleri 
Matematica numerica (2a Ed.) 

2000, XIV+448 pp, ISBN 88-470-0077-7 
2002, 2004 ristampa riveduta e corretta 
(1a edizione 1998, ISBN 88-470-0010-6) 


13. A. Quarteroni, F. Saleri 
Introduzione al Calcolo Scientifico (2a Ed.) 
2004, X+262 pp, ISBN 88-470-0256-7 
(1a edizione 2002, ISBN 88-470-0149-8) 


14. S. Salsa 
Equazioni a derivate parziali - Metodi, modelli e applicazioni 
2004, XII+426 pp, ISBN 88-470-0259-1 


15. G. Riccardi 
Calcolo differenziale ed integrale 
2004, XII+314 pp, ISBN 88-470-0285-0 


16. M. Impedovo 
Matematica generale con il calcolatore 
2005, X+526 pp, ISBN 88-470-0258-3 


17. L. Formaggia, FE. Saleri, A. Veneziani 
Applicazioni ed esercizi di modellistica numerica 
per problemi differenziali 
2005, VIII+396 pp, ISBN 88-470-0257-5 


18. S. Salsa, G. Verzini 
Equazioni a derivate parziali - Complementi ed esercizi 
2005, VIII+406 pp, ISBN 88-470-0260-5 
2007, ristampa con modifiche 


19. C. Canuto, A. Tabacco 
Analisi Matematica I (2a Ed.) 
2005, XII+448 pp, ISBN 88-470-0337-7 
(1a edizione, 2003, XII+376 pp, ISBN 88-470-0220-6) 


20. F. Biagini, M. Campanino 
Elementi di Probabilita e Statistica 
2006, XII+236 pp, ISBN 88-470-0330-X 


Ze, 


23; 


24. 


DD 


26. 


Zi 


28. 


29, 


30. 


S. Leonesi, C. Toffalori 
Numeri e Crittografia 
2006, VIII+178 pp, ISBN 88-470-0331-8 


A. Quarteroni, F. Saleri 
Introduzione al Calcolo Scientifico (3a Ed.) 
2006, X+306 pp, ISBN 88-470-0480-2 


S. Leonesi, C. Toffalori 
Un invito all’Algebra 
2006, XVII+432 pp, ISBN 88-470-0313-X 


W.M. Baldoni, C. Ciliberto, G.M. Piacentini Cattaneo 
Aritmetica, Crittografia e Codici 
2006, XVI+518 pp, ISBN 88-470-0455-1 


A. Quarteroni 

Modellistica numerica per problemi differenziali (3a Ed.) 
2006, XIV +452 pp, ISBN 88-470-0493-4 

(la edizione 2000, ISBN 88-470-0108-0) 

(2a edizione 2003, ISBN 88-470-0203-6) 


M. Abate, E Tovena 
Curve e superfici 
2006, XIV+394 pp, ISBN 88-470-0535-3 


L. Giuzzi 
Codici correttori 
2006, XVI+402 pp, ISBN 88-470-0539-6 


L. Robbiano 
Algebra lineare 
2007, XVI+210 pp, ISBN 88-470-0446-2 


E. Rosazza Gianin, C. Sgarra 
Esercizi di finanza matematica 
2007, X+184 pp, ISBN 978-88-470-0610-2 


A. Machi 

Gruppi - Una introduzione a idee e metodi della Teoria dei Gruppi 
2007, XII+350 pp, ISBN 978-88-470-0622-5 

2010, ristampa con modifiche 


al 


a2, 


33. 


34, 


35. 


36. 


OZ. 


38. 


a. 


40. 


4l. 


Y. Biollay, A. Chaabouni, J. Stubbe 
Matematica si parte! 

A cura di A. Quarteroni 

2007, XII+196 pp, ISBN 978-88-470-0675-1 


M. Manetti 
Topologia 
2008, XII+298 pp, ISBN 978-88-470-0756-7 


A. Pascucci 
Calcolo stocastico per la finanza 
2008, XVI+518 pp, ISBN 978-88-470-0600-3 


A. Quarteroni, R. Sacco, F. Saleri 
Matematica numerica (3a Ed.) 
2008, XVI+510 pp, ISBN 978-88-470-0782-6 


P. Cannarsa, T. D’Aprile 
Introduzione alla teoria della misura e all’analisi funzionale 
2008, XII+268 pp, ISBN 978-88-470-0701-7 


A. Quarteroni, F. Saleri 
Calcolo scientifico (4a Ed.) 
2008, XIV +358 pp, ISBN 978-88-470-0837-3 


C. Canuto, A. Tabacco 
Analisi Matematica I (3a Ed.) 
2008, XIV +452 pp, ISBN 978-88-470-0871-3 


S. Gabelli 
Teoria delle Equazioni e Teoria di Galois 
2008, XVI+410 pp, ISBN 978-88-470-0618-8 


A. Quarteroni 
Modellistica numerica per problemi differenziali (4a Ed.) 
2008, XVI+560 pp, ISBN 978-88-470-0841-0 


C. Canuto, A. Tabacco 

Analisi Matematica II 

2008, XVI+536 pp, ISBN 978-88-470-0873-1 
2010, ristampa con modifiche 


E. Salinelli, F Tomarelli 
Modelli Dinamici Discreti (2a Ed.) 
2009, XIV +382 pp, ISBN 978-88-470-1075-8 


43. 


44, 


45. 


46. 


47. 


48. 


49. 


50. 


51. 


a2; 


S. Salsa, EM.G. Vegni, A. Zaretti, P. Zunino 
Invito alle equazioni a derivate parziali 
2009, XIV +440 pp, ISBN 978-88-470-1179-3 


S. Dulli, S. Furini, E. Peron 
Data mining 
2009, XIV+178 pp, ISBN 978-88-470-1162-5 


A. Pascucci, W.J. Runggaldier 
Finanza Matematica 
2009, X+264 pp, ISBN 978-88-470-1441-1 


S. Salsa 


Equazioni a derivate parziali - Metodi, modelli e applicazioni (2a Ed.) 


2010, XVI+614 pp, ISBN 978-88-470- 1645-3 


C. D’Angelo, A. Quarteroni 
Matematica Numerica — Esercizi, Laboratori e Progetti 
2010, VIII+374 pp, ISBN 978-88-470-1639-2 


V. Moretti 


Teoria Spettrale e Meccanica Quantistica - Operatori in spazi di Hilbert 


2010, XVI+704 pp, ISBN 978-88-470-1610-1 


C. Parenti, A. Parmeggiani 
Algebra lineare ed equazioni differenziali ordinarie 
2010, VIII+208 pp, ISBN 978-88-470-1787-0 


B. Korte, J. Vygen 
Ottimizzazione Combinatoria. Teoria e Algoritmi 
2010, XVI+662 pp, ISBN 978-88-470-1522-7 


D. Mundici 
Logica: Metodo Breve 
2011, XII+126 pp, ISBN 978-88-470-1883-9 


E. Fortuna, R. Frigerio, R. Pardini 
Geometria proiettiva. Problemi risolti e richiami di teoria 
2011, VII+274 pp, ISBN 978-88-470-1746-7 


C. Presilla 
Elementi di Analisi Complessa. Funzioni di una variabile 
2011, XII+324 pp, ISBN 978-88-470- 1829-7 


54, 


D0; 


56. 


5, 


58. 


See 


60. 


61. 


62. 


63. 


L. Grippo, M. Sciandrone 
Metodi di ottimizzazione non vincolata 
2011, XIV+614 pp, ISBN 978-88-470-1793-1 


M. Abate, F. Tovena 
Geometria Differenziale 
2011, XIV +466 pp, ISBN 978-88-470-1919-5 


M. Abate, F. Tovena 
Curves and Surfaces 
2011, XIV+390 pp, ISBN 978-88-470- 1940-9 


A. Ambrosetti 
Appunti sulle equazioni differenziali ordinarie 
2011, X+114 pp, ISBN 978-88-470-2393-2 


L. Formaggia, EF Saleri, A. Veneziani 
Solving Numerical PDEs: Problems, Applications, Exercises 
2011, X+434 pp, ISBN 978-88-470-2411-3 


A. Machi 


Groups. An Introduction to Ideas and Methods of the Theory of Groups 


2011, XIV+372 pp, ISBN 978-88-470-2420-5 


A. Pascucci, W.J. Runggaldier 


Financial Mathematics. Theory and Problems for Multi-period Models 


2011, X+288 pp, ISBN 978-88-470-2537-0 


D. Mundici 
Logic: a Brief Course 
2012, XII+124 pp, ISBN 978-88-470-2360-4 


A. Machi 
Algebra for Symbolic Computation 
2012, VIII+174 pp, ISBN 978-88-470-2396-3 


A. Quarteroni, F. Saleri, P. Gervasio 
Calcolo Scientifico (5a ed.) 
2012, XVIII+450 pp, ISBN 978-88-470-2744-2 


A. Quarteroni 
Modellistica Numerica per Problemi Differenziali (5a ed.) 
2012, XVIII+628 pp, ISBN 978-88-470-2747-3 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


7k: 


72: 


73s 


V. Moretti 

Spectral Theory and Quantum Mechanics 

With an Introduction to the Algebraic Formulation 
2013, XVI+728 pp, ISBN 978-88-470-2834-0 


S. Salsa, EM.G. Vegni, A. Zaretti, P. Zunino 
A Primer on PDEs. Models, Methods, Simulations 
2013, XIV +482 pp, ISBN 978-88-470-2861-6 


V.I. Arnold 
Real Algebraic Geometry 
2013, X+110 pp, ISBN 978-3-642-36242-2 


F. Caravenna, P. Dai Pra 
Probabilita. Unintroduzione attraverso modelli e applicazioni 
2013, X+396 pp, ISBN 978-88-470-2594-3 


A. de Luca, E D’Alessandro 
Teoria degli Automi Finiti 
2013, XII+316 pp, ISBN 978-88-470-5473-8 


P. Biscari, T. Ruggeri, G. Saccomandi, M. Vianello 
Meccanica Razionale 
2013, XII+352 pp, ISBN 978-88-470-5696-3 


E. Rosazza Gianin, C. Sgarra 

Mathematical Finance: Theory Review and Exercises. From Binomial 
Model to Risk Measures 

2013, X+278pp, ISBN 978-3-319-01356-5 


E. Salinelli, F. Tomarelli 
Modelli Dinamici Discreti (3a Ed.) 
2014, XVI+394pp, ISBN 978-88-470-5503-2 


C. Presilla 
Elementi di Analisi Complessa. Funzioni di una variabile (2a Ed.) 
2014, XII+360pp, ISBN 978-88-470-5500-1 


S. Ahmad, A. Ambrosetti 
A Textbook on Ordinary Differential Equations 
2014, XIV+324pp, ISBN 978-3-319-02128-7 


70% 


76. 


Zi 


78. 


79. 


80. 


81. 


82. 


83. 


84. 


A. Bermudez, D. Gomez, P. Salgado 
Mathematical Models and Numerical Simulation in Electromagnetism 
2014, XVIII+430pp, ISBN 978-3-319-02948-1 


A. Quarteroni 
Matematica Numerica. Esercizi, Laboratori e Progetti (2a Ed.) 
2013, XVIII+406pp, ISBN 978-88-470-5540-7 


E. Salinelli, F Tomarelli 
Discrete Dynamical Models 
2014, XVI+386pp, ISBN 978-3-319-02290-1 


A. Quarteroni, R. Sacco, EF Saleri, P. Gervasio 
Matematica Numerica (4a Ed.) 
2014, XVIII+532pp, ISBN 978-88-470-5643-5 


M. Manetti 
Topologia (2a Ed.) 
2014, XII+334pp, ISBN 978-88-470-5661-9 


M. Iannelli, A. Pugliese 
An Introduction to Mathematical Population Dynamics. Along the trail 
of Volterra and Lotka 

2014, XIV+338pp, ISBN 978-3-319-03025-8 


V. M. Abrusci, L. Tortora de Falco 
Logica. Volume 1 
2014, X+180pp, ISBN 978-88-470-5537-7 


P. Biscari, T. Ruggeri, G. Saccomandi, M. Vianello 
Meccanica Razionale (2a Ed.) 
2014, XII+390pp, ISBN 978-88-470-5725-8 


C. Canuto, A. Tabacco 
Analisi Matematica I (4a Ed.) 
2014, XIV+508pp, ISBN 978-88-470-5722-7 


C. Canuto, A. Tabacco 
Analisi Matematica II (2a Ed.) 
2014, XII+576pp, ISBN 978-88-470-5728-9 


C. Canuto, A. Tabacco 
Mathematical Analysis I (2nd Ed.) 
2015, XIV+484pp, ISBN 978-3-319-12771-2 


85. C. Canuto, A. Tabacco 
Mathematical Analysis II (2nd Ed.) 
2015, X11+550pp, ISBN 978-3-319-12756-9 


The online version of the books published in this series is available at 
SpringerLink. 

For further information, please visit the following link: 
http://www.springer.com/series/5418 


